Text Generation
Safetensors
English
llama
samsja commited on
Commit
d139ddf
·
verified ·
1 Parent(s): 47f9f0f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +15 -4
README.md CHANGED
@@ -6,6 +6,17 @@ datasets:
6
  - PrimeIntellect/StackV1-popular
7
  - mlfoundations/dclm-baseline-1.0-parquet
8
  - open-web-math/open-web-math
 
 
 
 
 
 
 
 
 
 
 
9
  language:
10
  - en
11
  pipeline_tag: text-generation
@@ -96,13 +107,13 @@ First, we conducted an extensive series of 16 Supervised Fine-Tuning (SFT) train
96
  - arcee-ai/The-Tomb
97
 
98
  2. **Instruction Following**:
99
- - [mlabonne/open-perfectblend-fixed](MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
100
- - [microsoft/orca-agentinstruct-1M-v1-cleaned](mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
101
- - [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs)
102
 
103
  3. **Domain-Specific**:
104
  - [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
105
- - [Synthia coder](MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
106
  - [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
107
  - [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
108
 
 
6
  - PrimeIntellect/StackV1-popular
7
  - mlfoundations/dclm-baseline-1.0-parquet
8
  - open-web-math/open-web-math
9
+ - MaziyarPanahi/open-perfectblend-fixed
10
+ - mlabonne/orca-agentinstruct-1M-v1-cleaned
11
+ - Post-training-Data-Flywheel/AutoIF-instruct-61k
12
+ - Team-ACE/ToolACE
13
+ - MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt
14
+ - ServiceNow-AI/M2Lingual
15
+ - AI-MO/NuminaMath-TIR
16
+ - allenai/tulu-3-sft-personas-code
17
+ - tulu-3-sft-personas-math
18
+ - tulu-3-sft-personas-math-grade
19
+ - tulu-3-sft-personas-algebra
20
  language:
21
  - en
22
  pipeline_tag: text-generation
 
107
  - arcee-ai/The-Tomb
108
 
109
  2. **Instruction Following**:
110
+ - [mlabonne/open-perfectblend-fixed](https://huggingface.co/datasets/MaziyarPanahi/open-perfectblend-fixed) (generalist capabilities)
111
+ - [microsoft/orca-agentinstruct-1M-v1-cleaned](https://huggingface.co/datasets/mlabonne/orca-agentinstruct-1M-v1-cleaned) (Chain-of-Thought)
112
+ - [Post-training-Data-Flywheel/AutoIF-instruct-61k-with-funcs](https://huggingface.co/datasets/Post-training-Data-Flywheel/AutoIF-instruct-61k)
113
 
114
  3. **Domain-Specific**:
115
  - [Team-ACE/ToolACE](https://huggingface.co/datasets/Team-ACE/ToolACE) (function calling)
116
+ - [Synthia coder](https://huggingface.co/datasets/MaziyarPanahi/Synthia-Coder-v1.5-I-sharegpt) (programming)
117
  - [ServiceNow-AI/M2Lingual](https://huggingface.co/datasets/ServiceNow-AI/M2Lingual) (multilingual)
118
  - [AI-MO/NuminaMath-TIR](https://huggingface.co/datasets/AI-MO/NuminaMath-TIR) (mathematics)
119