yolay commited on
Commit
20c61b3
·
verified ·
1 Parent(s): af65516

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +16 -1
README.md CHANGED
@@ -43,4 +43,19 @@ The model DeepSeek-Qwen-7B is our optimized model for its advanced instruction-f
43
  | Ministral-8B-Instruct | Ours | 72.64 | 72.6 | 59.33 | 70.45 | 54.35 | <u>76.08</u> | 75.33 | 68.68 (+0.10%) |
44
  | DeepSeek-Qwen7B | I/O† | 60.81 | 72.39 | 57.99 | 66.86 | 59.59 | 62.80 | 79.64 | 65.73 |
45
  | DeepSeek-Qwen7B | SFT | 67.09 | 69.10 | 58.66 | 58.42 | 55.60 | 65.96 | 79.15 | 64.85 (-0.88%) |
46
- | DeepSeek-Qwen7B | Ours | 71.35 | 71.40 | 58.67 | 62.04 | 59.65 | 59.38 | 82.00 | 66.35 (+0.62%) |
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
43
  | Ministral-8B-Instruct | Ours | 72.64 | 72.6 | 59.33 | 70.45 | 54.35 | <u>76.08</u> | 75.33 | 68.68 (+0.10%) |
44
  | DeepSeek-Qwen7B | I/O† | 60.81 | 72.39 | 57.99 | 66.86 | 59.59 | 62.80 | 79.64 | 65.73 |
45
  | DeepSeek-Qwen7B | SFT | 67.09 | 69.10 | 58.66 | 58.42 | 55.60 | 65.96 | 79.15 | 64.85 (-0.88%) |
46
+ | DeepSeek-Qwen7B | Ours | 71.35 | 71.40 | 58.67 | 62.04 | 59.65 | 59.38 | 82.00 | 66.35 (+0.62%) |
47
+
48
+
49
+
50
+ 🎓 If you find this work useful, please consider the following citation:
51
+ ```
52
+ @article{qin2025incentivizingreasoningadvancedinstructionfollowing,
53
+ title={Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models},
54
+ author={Yulei Qin and Gang Li and Zongyi Li and Zihan Xu and Yuchen Shi and Zhekai Lin and Xiao Cui and Ke Li and Xing Sun},
55
+ year={2025},
56
+ eprint={2506.01413},
57
+ archivePrefix={arXiv},
58
+ primaryClass={cs.CV},
59
+ url={https://arxiv.org/abs/2506.01413}
60
+ }
61
+ ```