wwe180 commited on
Commit
179fed7
·
verified ·
1 Parent(s): 1c91384

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +35 -27
README.md CHANGED
@@ -1,18 +1,21 @@
1
  ---
2
  base_model:
3
- - openchat/openchat-3.6-8b-20240522
4
- - hfl/llama-3-chinese-8b-instruct-v2-lora
5
- - Sao10K/L3-8B-Stheno-v3.1
6
- - Jiar/Llama-3-8B-Chinese
7
- - NousResearch/Hermes-2-Theta-Llama-3-8B
8
- - camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters
9
- - NousResearch/Meta-Llama-3-8B-Instruct
10
  library_name: transformers
11
  tags:
12
  - mergekit
13
  - merge
14
-
 
 
15
  ---
 
 
 
 
 
 
 
16
  # merge
17
 
18
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
@@ -29,22 +32,27 @@ The following models were included in the merge:
29
  * [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1) + [Jiar/Llama-3-8B-Chinese](https://huggingface.co/Jiar/Llama-3-8B-Chinese)
30
  * [NousResearch/Hermes-2-Theta-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B) + [camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters](https://huggingface.co/camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters)
31
 
32
- ### Configuration
33
-
34
- The following YAML configuration was used to produce this model:
35
-
36
- ```yaml
37
- slices:
38
- - sources:
39
- - model: "Sao10K/L3-8B-Stheno-v3.1+Jiar/Llama-3-8B-Chinese"
40
- layer_range: [0, 22]
41
- - sources:
42
- - model: "NousResearch/Hermes-2-Theta-Llama-3-8B+camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters"
43
- layer_range: [10, 22]
44
- - sources:
45
- - model: "openchat/openchat-3.6-8b-20240522+hfl/llama-3-chinese-8b-instruct-v2-lora"
46
- layer_range: [10,32]
47
- merge_method: passthrough
48
- base_model: "NousResearch/Meta-Llama-3-8B-Instruct"
49
- dtype: bfloat16
50
- ```
 
 
 
 
 
 
1
  ---
2
  base_model:
3
+ - wwe180/Llama3-13B-lingyang-v1
 
 
 
 
 
 
4
  library_name: transformers
5
  tags:
6
  - mergekit
7
  - merge
8
+ - Llama3
9
+ license:
10
+ - other
11
  ---
12
+
13
+ # The model is experimental, so the results cannot be guaranteed.
14
+
15
+ After simple testing, the effect is good, stronger than llama-3-8b!
16
+
17
+
18
+
19
  # merge
20
 
21
  This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
32
  * [Sao10K/L3-8B-Stheno-v3.1](https://huggingface.co/Sao10K/L3-8B-Stheno-v3.1) + [Jiar/Llama-3-8B-Chinese](https://huggingface.co/Jiar/Llama-3-8B-Chinese)
33
  * [NousResearch/Hermes-2-Theta-Llama-3-8B](https://huggingface.co/NousResearch/Hermes-2-Theta-Llama-3-8B) + [camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters](https://huggingface.co/camillop/Meta-Llama-3-8B-ORPO-ITA-llama-adapters)
34
 
35
+ ## 💻 Usage
36
+
37
+ ```python
38
+ !pip install -qU transformers accelerate
39
+
40
+ from transformers import AutoTokenizer
41
+ import transformers
42
+ import torch
43
+
44
+ model = "Llama3-13B-lingyang-v1"
45
+ messages = [{"role": "user", "content": "What is a large language model?"}]
46
+
47
+ tokenizer = AutoTokenizer.from_pretrained(model)
48
+ prompt = tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
49
+ pipeline = transformers.pipeline(
50
+ "text-generation",
51
+ model=model,
52
+ torch_dtype=torch.float16,
53
+ device_map="auto",
54
+ )
55
+
56
+ ## Statement:
57
+
58
+ Llama3-10B-lingyang-v1 does not represent the views and positions of the model developers We will not be liable for any problems arising from the use of the Llama3-10B-lingyang-v1 open Source model, including but not limited to data security issues, risk of public opinion, or any risks and problems arising from the misdirection, misuse, dissemination or misuse of the model.