Suparious commited on
Commit
ce6e58f
·
verified ·
1 Parent(s): d62a5cf

Add model card

Browse files
Files changed (1) hide show
  1. README.md +34 -0
README.md ADDED
@@ -0,0 +1,34 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - quantized
4
+ - 4-bit
5
+ - AWQ
6
+ - autotrain_compatible
7
+ - endpoints_compatible
8
+ - text-generation-inference
9
+ license: apache-2.0
10
+ language:
11
+ - en
12
+ base_model: mistral-community/Mixtral-8x22B-v0.1
13
+ model_creator: Vezora
14
+ model_name: Mistral-22B-v0.2
15
+ model_type: mistral
16
+ pipeline_tag: text-generation
17
+ inference: false
18
+ ---
19
+ # Vezora/Mistral-22B-v0.1 AWQ
20
+
21
+ - Model creator: [Vezora](https://huggingface.co/Vezora)
22
+ - Original model: [Mistral-22B-v0.2](https://huggingface.co/Vezora/Mistral-22B-v0.2)
23
+
24
+ ## Model Summary
25
+
26
+ - Just two days after our release of **Mistral-22b-v0.1**, we are excited to introduce our handcrafted experimental model, **Mistral-22b-v.02**. This model is a culmination of equal knowledge distilled from all experts into a single, dense 22b model. This model is not a single trained expert, rather its a compressed MOE model, turning it into a dense 22b mode. This is the first working MOE to Dense model conversion.
27
+ - v0.2 has trained on 8x more data than v0.1!
28
+
29
+ ## How to use
30
+
31
+ **GUANACO PROMPT FORMAT** YOU MUST USE THE GUANACO PROMPT FORMAT SHOWN BELOW. Not using this prompt format will lead to sub optimal results.
32
+
33
+ - This model requires a specific chat template, as the training format was Guanaco this is what it looks like:
34
+ - "### System: You are a helpful assistant. ### Human###: Give me the best chili recipe you can ###Assistant: Here is the best chili recipe..."