PCL-Reasoner commited on
Commit
2d276c8
·
verified ·
1 Parent(s): 35ca8f2

Upload folder using huggingface_hub

Browse files
This view is limited to 50 files because it contains too many changes.   See raw diff
Files changed (50) hide show
  1. .gitattributes +3 -0
  2. .mdl +0 -0
  3. .msc +0 -0
  4. .mv +1 -0
  5. README.md +441 -3
  6. configuration.json +1 -0
  7. eval_results/aime24_eval_res.jsonl +3 -0
  8. eval_results/aime25_eval_res.jsonl +3 -0
  9. images/README/eval_results.png +3 -0
  10. safetensors/config.json +28 -0
  11. safetensors/configuration.json +1 -0
  12. safetensors/generation_config.json +6 -0
  13. safetensors/model-00001-of-00064.safetensors +3 -0
  14. safetensors/model-00002-of-00064.safetensors +3 -0
  15. safetensors/model-00003-of-00064.safetensors +3 -0
  16. safetensors/model-00004-of-00064.safetensors +3 -0
  17. safetensors/model-00005-of-00064.safetensors +3 -0
  18. safetensors/model-00006-of-00064.safetensors +3 -0
  19. safetensors/model-00007-of-00064.safetensors +3 -0
  20. safetensors/model-00008-of-00064.safetensors +3 -0
  21. safetensors/model-00009-of-00064.safetensors +3 -0
  22. safetensors/model-00010-of-00064.safetensors +3 -0
  23. safetensors/model-00011-of-00064.safetensors +3 -0
  24. safetensors/model-00012-of-00064.safetensors +3 -0
  25. safetensors/model-00013-of-00064.safetensors +3 -0
  26. safetensors/model-00014-of-00064.safetensors +3 -0
  27. safetensors/model-00015-of-00064.safetensors +3 -0
  28. safetensors/model-00016-of-00064.safetensors +3 -0
  29. safetensors/model-00017-of-00064.safetensors +3 -0
  30. safetensors/model-00018-of-00064.safetensors +3 -0
  31. safetensors/model-00019-of-00064.safetensors +3 -0
  32. safetensors/model-00020-of-00064.safetensors +3 -0
  33. safetensors/model-00021-of-00064.safetensors +3 -0
  34. safetensors/model-00022-of-00064.safetensors +3 -0
  35. safetensors/model-00023-of-00064.safetensors +3 -0
  36. safetensors/model-00024-of-00064.safetensors +3 -0
  37. safetensors/model-00025-of-00064.safetensors +3 -0
  38. safetensors/model-00026-of-00064.safetensors +3 -0
  39. safetensors/model-00027-of-00064.safetensors +3 -0
  40. safetensors/model-00028-of-00064.safetensors +3 -0
  41. safetensors/model-00029-of-00064.safetensors +3 -0
  42. safetensors/model-00030-of-00064.safetensors +3 -0
  43. safetensors/model-00031-of-00064.safetensors +3 -0
  44. safetensors/model-00032-of-00064.safetensors +3 -0
  45. safetensors/model-00033-of-00064.safetensors +3 -0
  46. safetensors/model-00034-of-00064.safetensors +3 -0
  47. safetensors/model-00035-of-00064.safetensors +3 -0
  48. safetensors/model-00036-of-00064.safetensors +3 -0
  49. safetensors/model-00037-of-00064.safetensors +3 -0
  50. safetensors/model-00038-of-00064.safetensors +3 -0
.gitattributes CHANGED
@@ -33,3 +33,6 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ eval_results/aime24_eval_res.jsonl filter=lfs diff=lfs merge=lfs -text
37
+ eval_results/aime25_eval_res.jsonl filter=lfs diff=lfs merge=lfs -text
38
+ images/README/eval_results.png filter=lfs diff=lfs merge=lfs -text
.mdl ADDED
Binary file (50 Bytes). View file
 
.msc ADDED
Binary file (7.41 kB). View file
 
.mv ADDED
@@ -0,0 +1 @@
 
 
1
+ Revision:master,CreatedAt:1753085179
README.md CHANGED
@@ -1,3 +1,441 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ <style>
2
+ /* 全局字体与间距 */
3
+ body {
4
+ font-family: 'Segoe UI', sans-serif;
5
+ line-height: 1.75; /* 行间距1.75倍 */
6
+ color: #333
7
+ margin: 0 auto; /* 内容居中 */
8
+ }
9
+
10
+ /* 段间距 */
11
+ p, ul, ol, blockquote {
12
+ margin-bottom: 1.25em; /* 段落/列表/引用间距 */
13
+ }
14
+
15
+
16
+ /* 代码块 */
17
+ pre {
18
+ background-color:rgb(146, 150, 153);
19
+ border-radius: 6px;
20
+ padding: 16px;
21
+ overflow: auto;
22
+ line-height: 1.45; /* 代码行距 */
23
+ }
24
+
25
+ /* 行内代码 */
26
+ code {
27
+ background: rgb(146, 150, 153);
28
+ padding: 2px 6px;
29
+ border-radius: 3px;
30
+ font-family: 'Fira Code', monospace;
31
+ }
32
+ </style>
33
+
34
+ # **PCL-Reasoner-V1模型**
35
+
36
+ ## 模型概览
37
+
38
+ 本次发布的PCL-Reasoner-V1模型,以Qwen2.5-32B-Base为起点,基于昇思框架与昇腾硬件进行了高性能的监督微调。经过微调,模型在数学推理能力上取得了显著提升:其在权威基准评测集AIME24上准确率达85.7%,AIME25上达84.2%,在32B参数级别模型中稳居前列。
39
+
40
+ 为促进技术共享与应用,我们已完整开源了PCL-Reasoner-V1的模型权重、微调数据及训练代码。该模型不仅是当下领先的32B数学推理模型之一,更为开发者提供了宝贵的专业领域监督微调实践经验与后训练解决方案。用户可参照以下教程轻松部署体验,深入探索后训练的实践方法与奥秘!
41
+
42
+ ![eval_results](images/README/eval_results.png)
43
+
44
+ ## 开发指导
45
+
46
+ ### 1. 模型文件
47
+
48
+ PCL-Reasoner-V1基于Qwen2.5-32B-Base进行微调后训练,训练流程基于MindFormers实现,主要涉及的文件有:
49
+
50
+ 数据处理:
51
+
52
+ ```
53
+ pcl_reasoner_v1
54
+ ├── qwen2_5_tokenizer.py # qwen2_5 tokenizer
55
+ ├── packing_handler.py # 数据packing处理
56
+ └── data_preprocess
57
+ ├── decontaminate.py # 数据污染检测
58
+ └── dataset_prehandle_and_split.py # 数据拆分及预处理
59
+ ```
60
+
61
+ 模型配置:
62
+
63
+ ```
64
+ pcl_reasoner_v1/config
65
+ ├── data_process_handling.yaml # 数据格式转换配置文件
66
+ ├── data_process_packing.yaml # 数据拼接配置文件
67
+ └── finetune_pcl_reasoner_v1_32k.yaml # 模型微调配置文件
68
+ ```
69
+
70
+ 任务启动脚本:
71
+
72
+ ```
73
+ pcl_reasoner_v1
74
+ └── run_pcl_reasoner_v1_finetune.sh # 模型微调启动脚本
75
+ ```
76
+
77
+
78
+ ### 2.环境及数据准备
79
+
80
+ #### 2.1 安装环境
81
+
82
+ | 软件| 版本 |
83
+ | --- | --- |
84
+ | 固件&驱动| 24.1.rc3.5 |
85
+ | CANN| 7.7.T9.0.B057:8.1.RC1 |
86
+ | Python | 3.10 |
87
+ | MindSpore | 2.6.0 |
88
+ | MindSpore TransFormers | r1.5.0 |
89
+
90
+ #### 2.2 数据处理
91
+
92
+ ##### 2.2.1 数据集下载
93
+
94
+ 用户可以从HuggingFace官方下载原始数据集:
95
+
96
+ | 数据集名称 | 数据集链接 |
97
+ | ------------------------------- | ----------------------------------------------------------------------------------------------------------------------------- |
98
+ | AM-DeepSeek-R1-0528-Distilled | [https://huggingface.co/a-m-team/AM-DeepSeek-R1-0528-Distilled](https://huggingface.co/a-m-team/AM-DeepSeek-R1-0528-Distilled) |
99
+
100
+ ##### 2.2.2 数据预处理
101
+
102
+ 首先,我们对源数据进行检测和筛选,操作分为两个步骤,验证集污染检测与数据筛选。
103
+
104
+ * 验证集污染检测:我们采用基于all-MiniLM-L6-v2模型计算文本余弦相似度的方法,对数学部分原始数据针对AIME24/25评测集进行污染检测。该脚本执行后会在终端打印检测结果,并在指定的输出路径中保存相似度大于阈值的题目及其匹配的评测集题目。
105
+
106
+ ```
107
+ python PCL-Reasoner-V1/pcl_reasoner_v1/data_preprocess/decontaminate.py \
108
+ --target_data /path/to/target_data \
109
+ --contaminant_source PCL-Reasoner-V1/pcl_reasoner_v1/data_preprocess/aime2425_questions.json \
110
+ --model_path /path/to/distilled/model_path \
111
+ --output_file_prefix /path/to/output_file_prefix
112
+ --threshold 0.7
113
+
114
+ # 参数说明
115
+ target_data:需要被检测的数据
116
+ contaminant_source:污染源,即评测集数据
117
+ model_path:计算文本嵌入的模型
118
+ output_file_prefix:检测结果输出的路径
119
+ threshold:相似度阈值
120
+ ```
121
+ * 数据筛选及处理:运行数据处理脚本,进行数据长度筛选,选取问题加思维链长度小于32K tokens的数据,并将提示词添加到数据中。
122
+
123
+ ```
124
+ python PCL-Reasoner-V1/pcl_reasoner_v1/data_preprocess/convert_and_split_dataset.py \
125
+ --json_file_paths /path/to/AM-DeepSeek-R1-0528-Distilled/math.jsonl
126
+
127
+ # 参数说明
128
+ json_file_paths:需要处理的数据集,支持传入多个路径,用空格分隔
129
+ ```
130
+
131
+ 其次,我们将数据转换成packing格式,操作分为两个步骤,格式转换与数据拼接。
132
+
133
+ * 格式转换:在配置文件`pcl_reasoner_v1/config/data_process_handling.yaml`中指定`data_files`、`vocab_file`、`merges_file`等文件路径,指定`pcl_reasoner_v1/packing_handler.py`文件中自定义的`AMDeepSeekDataHandler`为数据handler:
134
+
135
+ ```
136
+ train_dataset:
137
+ ...
138
+ path: "json" # 原始数据集文件格式
139
+ data_files:
140
+ ["/path/to/data.jsonl"] # 原始数据集路径
141
+ input_columns: *input_columns
142
+ handler:
143
+ - type: AMDeepSeekDataHandler # 指定自定义的数据处理类
144
+ ...
145
+ tokenizer:
146
+ auto_register: qwen2_5_tokenizer.Qwen2Tokenizer
147
+ ...
148
+ vocab_file: "/path/to/vocab.json" # Qwen2_5默认tokenizer文件
149
+ merges_file: "/path/to/merges.txt" # Qwen2_5默认tokenizer文件
150
+ ...
151
+ ```
152
+
153
+ *(注意事项:以上模型配置为示例,仅列出用户高频修改的配置项,完整配置文件见代码仓)*
154
+
155
+ 运行数据处理脚本,生成Arrow格式数据文件:
156
+
157
+ ```
158
+ export PYTHONPATH=/path/to/mindformers/:PYTHONPATH
159
+ python th/to/mindformers/toolkit/data_preprocess/huggingface/datasets_preprocess.py
160
+ --config ./pcl_reasoner_v1/config/data_process_handling.yaml
161
+ --save_path /path/to/handled_data/
162
+ --register_path ./pcl_reasoner_v1/
163
+
164
+ # 参数说明
165
+ config:数据格式转换的配置文件路径
166
+ save_path:转换后数据集的保存文件夹路径
167
+ register_path:自定义数据Handler注册目录路径
168
+ ```
169
+ * 数据拼接:
170
+
171
+ 在配置文件pcl_reasoner_v1/config/data_process_packing.yaml指定packing后数据的存储路径:
172
+
173
+ ```
174
+ # dataset
175
+ train_dataset:
176
+ data_loader:
177
+ ...
178
+ path: /path/to/handled_data #预处理后数据集的路径
179
+ ...
180
+ ```
181
+
182
+ *(注意事项:以上模型配置为示例,仅列出用户高频修改的配置项,完整配置文件见代码仓)*
183
+
184
+ 运行数据packing脚本,生成packing后数据文件:
185
+
186
+ ```
187
+ export PYTHONPATH=/path/to/mindformers/:PYTHONPATH
188
+ python /path/to/mindformers/toolkit/data_preprocess/huggingface/datasets_preprocess.py
189
+ --config ./pcl_reasoner_v1_config/data_process_packing.yaml
190
+ --save_path /path/to/packed_data/
191
+ --register_path ./pcl_reaoner_v1/
192
+
193
+ # 参数说明
194
+ config:数据拼接的配置文件路径
195
+ save_path:拼接后数据集的保存文件夹路径
196
+ register_path:自定义数据Handler注册目录路径
197
+ ```
198
+
199
+
200
+ ### 3 训练流程
201
+ #### 3.1 权重准备
202
+
203
+ 用户可以从HuggingFace官方下载预训练权重:
204
+
205
+ | 模型名称 | 权重链接 |
206
+ | ------------------- | --------------------------------------------------------------------------------- |
207
+ | Qwen2.5-32B-Base | [https://huggingface.co/Qwen/Qwen2.5-32B](https://huggingface.co/Qwen/Qwen2.5-32B) |
208
+
209
+ MindFormers 1.5.0及以上版本已支持safetensors格式的权重直接加载及保存,无需转换成ckpt,下文中微调将使用safetensors格式权重运行。
210
+
211
+ #### 3.2 训练配置
212
+
213
+ 下面仅列出用户高频修改的配置项,完整配置文件见`pcl_reasoner_v1/config/finetune_pcl_reasoner_v1_32k.yaml`
214
+
215
+ 基本配置:
216
+
217
+ ```
218
+ run_mode: 'finetune' # 设置训练模式为“finetune”
219
+ load_checkpoint: '/path/to/Qwen-32B-base/' # 权重文件路径
220
+ load_ckpt_format: 'safetensors' # 设置权重格式为“safetensors”
221
+ auto_trans_ckpt: True # 设置在线权重切分至分布式权重
222
+ ```
223
+
224
+ 数据集配置:
225
+
226
+ ```
227
+ train_dataset: &train_dataset
228
+  
229
+   data_loader:
230
+     type: CommonDataLoader
231
+  
232
+     # offline
233
+     path: "/path/to/dataset/pack_data_lt_32K_full" # 数据文件路径
234
+     load_func: 'load_from_disk' # 设置数据加载方式为“load_from_disk”
235
+    
236
+     shuffle: True # 数据打乱功能使能
237
+     packing: pack # 数据格式为pack
238
+     adaptor_config:
239
+       compress_mask: True
240
+     mock_config:
241
+       seq_length: 32768 # 数据pack后长度为32k
242
+       size: 25909 # 数据集大小/数据并行切分
243
+ ```
244
+
245
+ 并行配置:
246
+
247
+ ```
248
+ parallel_config:
249
+   data_parallel: &dp 8 # 数据并行切分为8
250
+   model_parallel: 8 # 模型并行切分为8
251
+   pipeline_stage: 2 # 流水线并行切分为2
252
+   use_seq_parallel: True # 序列并行使能
253
+   optimizer_shard: True  # 优化器并行使能
254
+   micro_batch_num: 16 # micro bathsize设置为16
255
+ ```
256
+
257
+ > *(注意事项:以上模型配置为示例,仅列出用户高频修改的配置项,完整配置文件见代码仓)*
258
+
259
+ #### 3.3 启动微调
260
+
261
+ 在启动脚本`run_pcl_reasoner_v1_finetune.sh`指定配置文件`pcl_reasoner_v1/config/finetune_pcl_reasoner_v1_32k.yaml`,并根据用户的实际情况对卡数、服务器IP等配置进行修改:
262
+
263
+ ```
264
+ noderank=$1
265
+
266
+ bash /path/to/mindformers/scripts/msrun_launcher.sh "run_mindformer.py \
267
+ --config /path/to/finetune_pcl_reasoner_v1_32k.yaml \
268
+ --run_mode finetune" \
269
+ --worker_num 128 \
270
+ --local_worker_num 8 \
271
+ --master_addr XX.XX.XX.XX \
272
+ --master_port XXXX \
273
+ --node_rank $noderank \
274
+ --log_dir /path/to/log \
275
+ --join False \
276
+ --cluster_time_out 1200 \
277
+ > run.log 2>&1
278
+
279
+ # 参数说明
280
+ config:配置文件路径
281
+ run_mode:运行模式(预训练/微调/推理)
282
+ worker_num: 总卡数
283
+ local_worker_num: 单机的卡数
284
+ master_addr:主节点地址
285
+ master_port: 主节点端口
286
+ log_dir: 日志路径
287
+ join:是否等待所有worker退出
288
+ cluster_time_out:集群等待时间
289
+ ```
290
+
291
+ 然后,使用`bash run_pcl_reasoner_v1_finetune.sh`指令启动微调训练,在多个节点上启动时,需指定`node_rank`(以下指令以0节点为示例):
292
+
293
+ ```
294
+ bash run_pcl_reasoner_v1_finetune.sh 0
295
+ ```
296
+
297
+ 在拉起任务后,通过以下指令查看运行日志:
298
+
299
+ ```
300
+ tail -f path/to/log/worker_127.log
301
+ ```
302
+
303
+
304
+ ### 4. 评测流程
305
+
306
+ 为了保障评测结果的公平性,我们采用了QwQ开源的评测代码(QwQ/eval at main · QwenLM/QwQ),可以根据代码仓中README.md指导进行环境安装及模型评测。
307
+
308
+ 我们采用的评测超参如下所示:
309
+
310
+ | 采样超参 | 取值 |
311
+ | ---------------- | --------------------------------------------- |
312
+ | temperature | 0.6 |
313
+ | top\_k | 40 |
314
+ | top\_p | 0.95 |
315
+ | max\_tokens | 129024 |
316
+ | chat\_template |`./pcl_reasoner_v1/eval/am_thinking.jinja` |
317
+
318
+ 我们在AIME24/AIME25评测结果详见下表数据。为确保评估准确性,我们采用Avg@32指标(平均32次采样)进行了评测:
319
+
320
+
321
+ <!-- 表格基础样式(可选添加) -->
322
+
323
+ <style>
324
+ table { border-collapse: collapse; width: 100%; margin-left: auto;margin-right: auto;}
325
+ th, td { border: 1px solid #ddd; padding: 8px; text-align: center; }
326
+ </style>
327
+
328
+ <!-- 表格主体 -->
329
+
330
+ <table>
331
+ <tr>
332
+ <th>参数量</th>
333
+ <th>模型</th>
334
+ <th>AIME 24</th>
335
+ <th>AIME 25</th>
336
+ </tr>
337
+ <!-- 合并行表头 >100B -->
338
+ <tr>
339
+ <th rowspan="6">&gt;100B</th>
340
+ </tr>
341
+ <!-- >100B 组数据行 -->
342
+ <tr>
343
+ <td>DeepSeek-R1</td>
344
+ <td><span style="color:grey">79.8</span></td>
345
+ <td><span style="color:grey">70</span></td>
346
+ </tr>
347
+ <tr>
348
+ <td>DeepSeek-R1-0528</td>
349
+ <td><span style="color:red">91.4</span></td>
350
+ <td><span style="color:red">87.5</span></td>
351
+ </tr>
352
+ <tr>
353
+ <td>Qwen3-235B-A22B</td>
354
+ <td><span style="color:grey">85.7</span></td>
355
+ <td><span style="color:grey">81.5</span></td>
356
+ </tr>
357
+ <tr>
358
+ <td>OpenAI-o3</td>
359
+ <td><b>91.6</b></td>
360
+ <td><b>88.9</b></td>
361
+ </tr>
362
+ <tr>
363
+ <td>Gemini-2.5-Pro-0506</td>
364
+ <td><span style="color:red">90.8</span></td>
365
+ <td><span style="color:grey">83</span></td>
366
+ </tr>
367
+ <!-- 分隔行 -->
368
+ <tr>
369
+ <td colspan="4" style="background-color: #f8f8f8;"></td>
370
+ </tr>
371
+ <!-- 合并行表头 32B -->
372
+ <tr>
373
+ <th rowspan="7">32B</th>
374
+ </tr>
375
+ <!-- 32B 组数据行 -->
376
+ <tr>
377
+ <td>Qwen3-32B</td>
378
+ <td><span style="color:grey">81.4</span></td>
379
+ <td><span style="color:grey">72.9</span></td>
380
+ </tr>
381
+ <tr>
382
+ <td>QwQ-32B</td>
383
+ <td><span style="color:grey">79.5</span></td>
384
+ <td><span style="color:grey">69.5</span></td>
385
+ </tr>
386
+ <tr>
387
+ <td>DeepSeek-R1-Distill-Qwen-32B</td>
388
+ <td><span style="color:grey">72.6</span></td>
389
+ <td><span style="color:grey">49.6</span></td>
390
+ </tr>
391
+ <tr>
392
+ <td>Skywork-OR1-32B</td>
393
+ <td><span style="color:grey">82.2</span></td>
394
+ <td><span style="color:grey">73.3</span></td>
395
+ </tr>
396
+ <tr>
397
+ <td>AM-Thinking-v1</td>
398
+ <td><span style="color:grey">85.3</span></td>
399
+ <td><span style="color:grey">74.4</span></td>
400
+ </tr>
401
+ <tr>
402
+ <td>PCL-Reasoner-v1</td>
403
+ <td><b>85.7</b></td>
404
+ <td><b>84.2</b></td>
405
+ </tr>
406
+ </table>
407
+
408
+ > *(注:模型在AIME24/25评测集上的生成结果文件已同步上传至 `pcl_reasoner_v1/eval/eval_res`目录,供开发者用于模型验证与效果比对参考)*
409
+
410
+
411
+ 另外,我们也针对评测时不同模型回答长度统计正确率,可以看出AIME24/25评测集对回答长度要求较高,而且较为简单的AIME24上,64K tokens的回答长度可以满足,而较为难的AIME25上则需要回答长度长达128K tokens:
412
+
413
+ <style>
414
+ table { border-collapse: collapse; width: 100%; margin-left: auto;margin-right: auto;}
415
+ th, td { border: 1px solid #ddd; padding: 8px; text-align: center; }
416
+ </style>
417
+
418
+ <table>
419
+ <tr>
420
+ <th>回答长度</th>
421
+ <th>16K</th>
422
+ <th>32K</th>
423
+ <th>64K</th>
424
+ <th>128K</th>
425
+ </tr>
426
+ <tr>
427
+ <td>AIME24</td>
428
+ <td>42.0</td>
429
+ <td>77.9</td>
430
+ <td>85.7</td>
431
+ <td>85.7</td>
432
+ </tr>
433
+ <tr>
434
+ <td>AIME25</td>
435
+ <td>33.4</td>
436
+ <td>75.6</td>
437
+ <td>83.9</td>
438
+ <td>84.2</td>
439
+ </tr>
440
+ </table>
441
+
configuration.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"framework":"other","task":"nli"}
eval_results/aime24_eval_res.jsonl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9b197b5c64a57be8316262ea2ad122598e81e601644185ae1f82cb95cc845d43
3
+ size 55231460
eval_results/aime25_eval_res.jsonl ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:51ed6f0ea7568de1a4edd49109df6967b8c7cc24775cd2dbbe8c6c7af6a01fc2
3
+ size 62529130
images/README/eval_results.png ADDED

Git LFS Details

  • SHA256: 9124b6dad310c6dfecc7237e03200a2a8f80d0867d07db1137dd5b10ac44c9ef
  • Pointer size: 131 Bytes
  • Size of remote file: 146 kB
safetensors/config.json ADDED
@@ -0,0 +1,28 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "architectures": [
3
+ "Qwen2ForCausalLM"
4
+ ],
5
+ "attention_dropout": 0.0,
6
+ "bos_token_id": 151643,
7
+ "eos_token_id": 151643,
8
+ "hidden_act": "silu",
9
+ "hidden_size": 5120,
10
+ "initializer_range": 0.02,
11
+ "intermediate_size": 27648,
12
+ "max_position_embeddings": 131072,
13
+ "max_window_layers": 64,
14
+ "model_type": "qwen2",
15
+ "num_attention_heads": 40,
16
+ "num_hidden_layers": 64,
17
+ "num_key_value_heads": 8,
18
+ "rms_norm_eps": 1e-05,
19
+ "rope_scaling": null,
20
+ "rope_theta": 1000000.0,
21
+ "sliding_window": 131072,
22
+ "tie_word_embeddings": false,
23
+ "torch_dtype": "bfloat16",
24
+ "transformers_version": "4.52.1",
25
+ "use_cache": true,
26
+ "use_sliding_window": false,
27
+ "vocab_size": 152064
28
+ }
safetensors/configuration.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {}
safetensors/generation_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "bos_token_id": 151643,
4
+ "eos_token_id": 151643,
5
+ "transformers_version": "4.52.1"
6
+ }
safetensors/model-00001-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c515f7783cabf67c4b703b2ab409e57b036c9378e0fad540ef30fbbe00858efb
3
+ size 2532347336
safetensors/model-00002-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a4dd544510c6be7267bede33d0c4c58f18edf1aafe15e92a8d133e43508d25ef
3
+ size 975211840
safetensors/model-00003-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b800b350d6c7b8b345c5947d4e2a6ce9381c4316d058a9c38d2e748bd66abb6a
3
+ size 975211840
safetensors/model-00004-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:31d530e16424c0fc5180220e2b36bd13d8efbedc4e6a9eeae15b124d1baf68c3
3
+ size 975211840
safetensors/model-00005-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3c5d746da9ef5a7a1f2564ff054d6d3d27053a7d5a56760994a8bf8abd4324df
3
+ size 975211840
safetensors/model-00006-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:175d3a92049e073c57fab61d945d40bfba29c9f86cf25a16369fa12a2a8eed6f
3
+ size 975211840
safetensors/model-00007-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4cedeaf5800f84de2ca216b98d9360ef2416b3133902f0d272bf21f35461108c
3
+ size 975211840
safetensors/model-00008-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b7811a287b6cde0067a862c7e449e75c11afd3a7d29c8e4ff368efc35f9b3316
3
+ size 975211840
safetensors/model-00009-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:60e77b27d07cf1201cca0ea49296106f4aaa0f558e2570b4139f1ae43a7aa7e6
3
+ size 975211840
safetensors/model-00010-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dfd138423a75947e0a8ceb36162df07471dd789493af8a3807bca5f33e0ba5a7
3
+ size 975211840
safetensors/model-00011-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fac950d600f590b5e85d1147808a6d0280c0f01177749dc393757a8f8370498b
3
+ size 975211856
safetensors/model-00012-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bd10be26a80e8c54393a4ab8f1af0a9ec916dfbc274e1ddedf0832f5a34b8b43
3
+ size 975211856
safetensors/model-00013-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a3e0eca44e051a79d6ab37f034134b04f47b7075162ca2678f01e7d84348ca4d
3
+ size 975211856
safetensors/model-00014-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:516b825342814bb132b28f88e3e2b4585efc30c72fd3cab27d78839ff614f58d
3
+ size 975211856
safetensors/model-00015-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5b70782a8ce88eb11d0cde2fdd800a3fee23ef7ceb1536101befb79701959a2b
3
+ size 975211856
safetensors/model-00016-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ef756924c93bb25b0eff2c95fc2914442a771234cc80072c75164b8b08cd4b76
3
+ size 975211856
safetensors/model-00017-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5f1976810f7dae5f82b5c6767247b341a3fd074f8e1be30795ae377191fe3096
3
+ size 975211856
safetensors/model-00018-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f4576dc86e9d3e4854e2f5e1a2e3f15f87da7b532fed1f9569695bf57cc8019f
3
+ size 975211856
safetensors/model-00019-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a0203d156a4cc7d4e3e7aa19c40a0f384e13589d7df1b35469585a1b33dba7b8
3
+ size 975211856
safetensors/model-00020-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3091ea80e87285e3d22788bf2ced4da5ffd2cb79742754014ba3c96367b0c519
3
+ size 975211856
safetensors/model-00021-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:598a1515a0868d4c4cc0a4a9cfa52d7c403be78cbafde6bf2242495575626c30
3
+ size 975211856
safetensors/model-00022-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d2608f3d1c866789da80bf7b4c8d13a933d924cfed78c6aa093f5c4eff96eb74
3
+ size 975211856
safetensors/model-00023-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cb5fdf1c908d8ed6b44d435dd4af192a2239e12e0aa1f590de0377df3f514b4f
3
+ size 975211856
safetensors/model-00024-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:55d1de446a5f8e3f3c07128712b59cc6cd6235a637c30fc47117e4d5ac5563ad
3
+ size 975211856
safetensors/model-00025-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4152feee81fd1f3596924b11710bc31d56c3c715ad7080e6b6b1c4578d18ac6d
3
+ size 975211856
safetensors/model-00026-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f697d79f6a0b88a260c2e80118f676d01282264eaf7bc2bd08ff4c666ebf775b
3
+ size 975211856
safetensors/model-00027-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4f9cc4673c82ff107bb95ea39ed9b5b0f065bfb64b1587e3c726b31a445f514c
3
+ size 975211856
safetensors/model-00028-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:605c3f95ec802233dbfdcea8c68144c783c8ddaa1b6925d6715c6b88c56e4bee
3
+ size 975211856
safetensors/model-00029-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a41a2fae62710e059349f53eca3a36c81a6e72f618b940d33f7667780846094a
3
+ size 975211856
safetensors/model-00030-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:9fe5cdfaabae63f288692e16ea02e9ece7244aebb653e44ab5283b2b638c1e86
3
+ size 975211856
safetensors/model-00031-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85fb0221dce2bff2a9c525c1a1476d84ab5fed8ef20c138e6d4c1b11c0ad68d0
3
+ size 975211856
safetensors/model-00032-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:09f04580202c6d1c19450bb244a0e6df5e876ee9beb3565f3ddb9b36d44660c0
3
+ size 975211856
safetensors/model-00033-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:03e92e4222cb2f5f9c8840f706b2c57cdf02c6c5909093af082506dc3957169b
3
+ size 975211856
safetensors/model-00034-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:371959bf1b3984beabe8a4f2f98fa4efd4fa948c107ddbc7086c3e1fd40b920f
3
+ size 975211856
safetensors/model-00035-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2fe59baa42b90ecd7459b12f2db485cb3c5e6e4c7827b997bd0ec6cee3c03cfc
3
+ size 975211856
safetensors/model-00036-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b1453ac82c171c9b75559c75e6629535cf618d686a238130cb3df28dee54f0e5
3
+ size 975211856
safetensors/model-00037-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:56540af10fccb6f6eaba186395a1c0fcf291f7e53a1438bdd248282b8400327e
3
+ size 975211856
safetensors/model-00038-of-00064.safetensors ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fcaa84d3bc8d4130bf2e0728ea8fdbdf2fc05405248bc759bf80f13dedd6142c
3
+ size 975211856