Update README.md
Browse files
README.md
CHANGED
@@ -10,6 +10,7 @@ datasets:
|
|
10 |
- silk-road/alpaca-data-gpt4-chinese
|
11 |
- databricks/databricks-dolly-15k
|
12 |
- microsoft/orca-math-word-problems-200k
|
|
|
13 |
language:
|
14 |
- zh
|
15 |
- en
|
@@ -18,23 +19,27 @@ language:
|
|
18 |
|
19 |
### 数据集
|
20 |
使用以下数据集
|
21 |
-
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636f54b95d2050767e4a6317/
|
22 |
对Qwen1.5-7B-Chat微调,测试结果显示CEVAL和MMLU分数均有上升
|
23 |
|
24 |
### 模型:
|
25 |
- https://huggingface.co/Qwen/Qwen1.5-7B-Chat
|
26 |
|
27 |
### 数据集:
|
|
|
28 |
- https://huggingface.co/datasets/TigerResearch/sft_zh
|
29 |
- https://huggingface.co/datasets/silk-road/alpaca-data-gpt4-chinese
|
30 |
- https://huggingface.co/datasets/LooksJuicy/ruozhiba
|
|
|
|
|
|
|
31 |
|
32 |
|
33 |
### 结果
|
34 |
| 模型名称 | CEVAL | MMLU |
|
35 |
|------------------------ |-------|------|
|
36 |
| Qwen1.5-7B-Chat | 68.61 | 61.56 |
|
37 |
-
| Qwen1.5-7B-Chat-sft-lora-tigerbot-alpacadatagpt4-ruozhiba-1epoch | 71.
|
38 |
|
39 |
### License
|
40 |
-
This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.
|
|
|
10 |
- silk-road/alpaca-data-gpt4-chinese
|
11 |
- databricks/databricks-dolly-15k
|
12 |
- microsoft/orca-math-word-problems-200k
|
13 |
+
- Sao10K/Claude-3-Opus-Instruct-5K
|
14 |
language:
|
15 |
- zh
|
16 |
- en
|
|
|
19 |
|
20 |
### 数据集
|
21 |
使用以下数据集
|
22 |
+
![image/png](https://cdn-uploads.huggingface.co/production/uploads/636f54b95d2050767e4a6317/maYhXsWKddThOBRvU5HtZ.png)
|
23 |
对Qwen1.5-7B-Chat微调,测试结果显示CEVAL和MMLU分数均有上升
|
24 |
|
25 |
### 模型:
|
26 |
- https://huggingface.co/Qwen/Qwen1.5-7B-Chat
|
27 |
|
28 |
### 数据集:
|
29 |
+
- https://huggingface.co/datasets/REILX/extracted_tagengo_gpt4
|
30 |
- https://huggingface.co/datasets/TigerResearch/sft_zh
|
31 |
- https://huggingface.co/datasets/silk-road/alpaca-data-gpt4-chinese
|
32 |
- https://huggingface.co/datasets/LooksJuicy/ruozhiba
|
33 |
+
- https://huggingface.co/datasets/microsoft/orca-math-word-problems-200k
|
34 |
+
- https://huggingface.co/datasets/alexl83/AlpacaDataCleaned
|
35 |
+
- https://huggingface.co/datasets/Sao10K/Claude-3-Opus-Instruct-5K
|
36 |
|
37 |
|
38 |
### 结果
|
39 |
| 模型名称 | CEVAL | MMLU |
|
40 |
|------------------------ |-------|------|
|
41 |
| Qwen1.5-7B-Chat | 68.61 | 61.56 |
|
42 |
+
| Qwen1.5-7B-Chat-sft-lora-tigerbot-alpacadatagpt4-ruozhiba-1epoch | 71.36 | |
|
43 |
|
44 |
### License
|
45 |
+
This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.
|