Update README.md
Browse files
README.md
CHANGED
|
@@ -43,4 +43,19 @@ The model DeepSeek-Qwen-7B is our optimized model for its advanced instruction-f
|
|
| 43 |
| Ministral-8B-Instruct | Ours | 72.64 | 72.6 | 59.33 | 70.45 | 54.35 | <u>76.08</u> | 75.33 | 68.68 (+0.10%) |
|
| 44 |
| DeepSeek-Qwen7B | I/O† | 60.81 | 72.39 | 57.99 | 66.86 | 59.59 | 62.80 | 79.64 | 65.73 |
|
| 45 |
| DeepSeek-Qwen7B | SFT | 67.09 | 69.10 | 58.66 | 58.42 | 55.60 | 65.96 | 79.15 | 64.85 (-0.88%) |
|
| 46 |
-
| DeepSeek-Qwen7B | Ours | 71.35 | 71.40 | 58.67 | 62.04 | 59.65 | 59.38 | 82.00 | 66.35 (+0.62%) |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 43 |
| Ministral-8B-Instruct | Ours | 72.64 | 72.6 | 59.33 | 70.45 | 54.35 | <u>76.08</u> | 75.33 | 68.68 (+0.10%) |
|
| 44 |
| DeepSeek-Qwen7B | I/O† | 60.81 | 72.39 | 57.99 | 66.86 | 59.59 | 62.80 | 79.64 | 65.73 |
|
| 45 |
| DeepSeek-Qwen7B | SFT | 67.09 | 69.10 | 58.66 | 58.42 | 55.60 | 65.96 | 79.15 | 64.85 (-0.88%) |
|
| 46 |
+
| DeepSeek-Qwen7B | Ours | 71.35 | 71.40 | 58.67 | 62.04 | 59.65 | 59.38 | 82.00 | 66.35 (+0.62%) |
|
| 47 |
+
|
| 48 |
+
|
| 49 |
+
|
| 50 |
+
🎓 If you find this work useful, please consider the following citation:
|
| 51 |
+
```
|
| 52 |
+
@article{qin2025incentivizingreasoningadvancedinstructionfollowing,
|
| 53 |
+
title={Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models},
|
| 54 |
+
author={Yulei Qin and Gang Li and Zongyi Li and Zihan Xu and Yuchen Shi and Zhekai Lin and Xiao Cui and Ke Li and Xing Sun},
|
| 55 |
+
year={2025},
|
| 56 |
+
eprint={2506.01413},
|
| 57 |
+
archivePrefix={arXiv},
|
| 58 |
+
primaryClass={cs.CV},
|
| 59 |
+
url={https://arxiv.org/abs/2506.01413}
|
| 60 |
+
}
|
| 61 |
+
```
|