HuggingFaceH4
/

zephyr-7b-gemma-v0.1

@@ -53,18 +53,16 @@ At the time of release, Zephyr 7B Gemma is the highest ranked 7B chat model on t
-In particular, on several categories of MT-Bench, Zephyr-7B-β has strong performance compared to larger open models like Llama2-Chat-70B:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6200d0a443eb0913fa2df7cc/raxvt5ma16d7T23my34WC.png)
-However, on more complex tasks like coding and mathematics, Zephyr-7B-β lags behind proprietary models and more research is needed to close the gap.
 ## Intended uses & limitations
-The model was initially fine-tuned on a filtered and preprocessed of the [`UltraChat`](https://huggingface.co/datasets/stingning/ultrachat) dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.
-We then further aligned the model with [🤗 TRL's](https://github.com/huggingface/trl) `DPOTrainer` on the [openbmb/UltraFeedback](https://huggingface.co/datasets/openbmb/UltraFeedback) dataset, which contains 64k prompts and model completions that are ranked by GPT-4. As a result, the model can be used for chat and you can check out our [demo](https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat) to test its capabilities.
-You can find the datasets used for training Zephyr-7B-β [here](https://huggingface.co/collections/HuggingFaceH4/zephyr-7b-6538c6d6d5ddd1cbb1744a66)
 Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
@@ -76,25 +74,35 @@ Here's how you can run the model using the `pipeline()` function from 🤗 Trans
 import torch
 from transformers import pipeline
-pipe = pipeline("text-generation", model="HuggingFaceH4/zephyr-7b-gemma", torch_dtype=torch.bfloat16, device_map="auto")
-# We use ChatML to format each message - see https://huggingface.co/docs/transformers/main/en/chat_templating
 messages = [
     {
         "role": "system",
-        "content": "You are a friendly chatbot who always responds in the style of a pirate",
     },
     {"role": "user", "content": "How many helicopters can a human eat in one sitting?"},
 ]
-prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
-outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
-print(outputs[0]["generated_text"])
-# <|system|>
-# You are a friendly chatbot who always responds in the style of a pirate.</s>
-# <|user|>
-# How many helicopters can a human eat in one sitting?</s>
-# <|assistant|>
-# Ah, me hearty matey! But yer question be a puzzler! A human cannot eat a helicopter in one sitting, as helicopters are not edible. They be made of metal, plastic, and other materials, not food!
 ```
 ## Bias, Risks, and Limitations

+In particular, on several categories of MT-Bench, Zephyρ 7B Gemma has strong performance compared to larger open models like Llama2-Chat-70B:
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6200d0a443eb0913fa2df7cc/raxvt5ma16d7T23my34WC.png)
+However, on more complex tasks like coding and mathematics, Zephyr 7B Gemma lags behind proprietary models and more research is needed to close the gap.
 ## Intended uses & limitations
+The model was initially fine-tuned on the [DEITA 10K](https://huggingface.co/datasets/HuggingFaceH4/deita-10k-v0-sft)  dataset, which contains a diverse range of synthetic dialogues generated by ChatGPT.
+We then further aligned the model with [🤗 TRL's](https://github.com/huggingface/trl) `DPOTrainer` on the [argilla/dpo-mix-7k](https://huggingface.co/datasets/argilla/dpo-mix-7k) dataset, which contains 7k prompts and model completions that are ranked by GPT-4. As a result, the model can be used for chat and you can check out our [demo](https://huggingface.co/spaces/HuggingFaceH4/zephyr-chat) to test its capabilities.
 Here's how you can run the model using the `pipeline()` function from 🤗 Transformers:
 import torch
 from transformers import pipeline
+pipe = pipeline(
+    "text-generation",
+    model="HuggingFaceH4/zephyr-7b-gemma",
+    device_map="auto",
+    torch_dtype=torch.bfloat16,
+)
 messages = [
     {
         "role": "system",
+        "content": "",  # Model not yet trained for follow this
     },
     {"role": "user", "content": "How many helicopters can a human eat in one sitting?"},
 ]
+outputs = pipe(
+    messages,
+    max_new_tokens=128,
+    do_sample=True,
+    temperature=0.7,
+    top_k=50,
+    top_p=0.95,
+    stop_sequence="<|im_end|>",
+)
+print(outputs[0]["generated_text"][-1]["content"])
+# It is not possible for a human to eat a helicopter in one sitting, as a
+# helicopter is a large and inedible machine. Helicopters are made of metal,
+# plastic, and other materials that are not meant to be consumed by humans.
+# Eating a helicopter would be extremely dangerous and would likely cause
+# serious health problems, including choking, suffocation, and poisoning. It is
+# important to only eat food that is safe and intended for human consumption.
 ```
 ## Bias, Risks, and Limitations