Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,17 @@ datasets:
|
|
6 |
- PrimeIntellect/StackV1-popular
|
7 |
- mlfoundations/dclm-baseline-1.0-parquet
|
8 |
- open-web-math/open-web-math
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
language:
|
10 |
- en
|
11 |
pipeline_tag: text-generation
|
@@ -96,13 +107,13 @@ First, we conducted an extensive series of 16 Supervised Fine-Tuning (SFT) train
|
|
96 |
- arcee-ai/The-Tomb
|
97 |
|
98 |
2. **Instruction Following**:
|
99 |
-
- [mlabonne/open-perfectblend-fixed](MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
|
100 |
-
- [microsoft/orca-agentinstruct-1M-v1-cleaned](mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
|
101 |
-
- [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](Post-training-Data-Flywheel/AutoIF-instruct-61k
|
102 |
|
103 |
3. **Domain-Specific**:
|
104 |
- [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
|
105 |
-
- [Synthia coder](MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
|
106 |
- [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
|
107 |
- [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
|
108 |
|
|
|
6 |
- PrimeIntellect/StackV1-popular
|
7 |
- mlfoundations/dclm-baseline-1.0-parquet
|
8 |
- open-web-math/open-web-math
|
9 |
+
- MaziyarPanahi/open-perfectblend-fixed
|
10 |
+
- mlabonne/orca-agentinstruct-1M-v1-cleaned
|
11 |
+
- Post-training-Data-Flywheel/AutoIF-instruct-61k
|
12 |
+
- Team-ACE/ToolACE
|
13 |
+
- MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt
|
14 |
+
- ServiceNow-AI/M2Lingual
|
15 |
+
- AI-MO/NuminaMath-TIR
|
16 |
+
- allenai/tulu-3-sft-personas-code
|
17 |
+
- tulu-3-sft-personas-math
|
18 |
+
- tulu-3-sft-personas-math-grade
|
19 |
+
- tulu-3-sft-personas-algebra
|
20 |
language:
|
21 |
- en
|
22 |
pipeline_tag: text-generation
|
|
|
107 |
- arcee-ai/The-Tomb
|
108 |
|
109 |
2. **Instruction Following**:
|
110 |
+
- [mlabonne/open-perfectblend-fixed](https://huggingface.co/datasets/MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
|
111 |
+
- [microsoft/orca-agentinstruct-1M-v1-cleaned](https://huggingface.co/datasets/mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
|
112 |
+
- [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](https://huggingface.co/datasets/Post-training-Data-Flywheel/AutoIF-instruct-61k)
|
113 |
|
114 |
3. **Domain-Specific**:
|
115 |
- [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
|
116 |
+
- [Synthia coder](https://huggingface.co/datasets/MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
|
117 |
- [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
|
118 |
- [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
|
119 |
|