yolay
/

RAIF-DeepSeek-Qwen-7B

Text Generation

text-generation-inference

Model card Files Files and versions

yolay commited on Jun 3

Commit

20c61b3

·

verified ·

1 Parent(s): af65516

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -43,4 +43,19 @@ The model DeepSeek-Qwen-7B is our optimized model for its advanced instruction-f
 | Ministral-8B-Instruct  | Ours     | 72.64  | 72.6  | 59.33    | 70.45        | 54.35    | <u>76.08</u> | 75.33      | 68.68 (+0.10%)  |
 | DeepSeek-Qwen7B        | I/O†     | 60.81  | 72.39 | 57.99    | 66.86        | 59.59    | 62.80        | 79.64      | 65.73        |
 | DeepSeek-Qwen7B        | SFT      | 67.09  | 69.10 | 58.66    | 58.42        | 55.60    | 65.96        | 79.15      | 64.85 (-0.88%)  |
-| DeepSeek-Qwen7B        | Ours     | 71.35  | 71.40 | 58.67    | 62.04        | 59.65    | 59.38        | 82.00      | 66.35 (+0.62%) |

 | Ministral-8B-Instruct  | Ours     | 72.64  | 72.6  | 59.33    | 70.45        | 54.35    | <u>76.08</u> | 75.33      | 68.68 (+0.10%)  |
 | DeepSeek-Qwen7B        | I/O†     | 60.81  | 72.39 | 57.99    | 66.86        | 59.59    | 62.80        | 79.64      | 65.73        |
 | DeepSeek-Qwen7B        | SFT      | 67.09  | 69.10 | 58.66    | 58.42        | 55.60    | 65.96        | 79.15      | 64.85 (-0.88%)  |
+| DeepSeek-Qwen7B        | Ours     | 71.35  | 71.40 | 58.67    | 62.04        | 59.65    | 59.38        | 82.00      | 66.35 (+0.62%) |
+🎓 If you find this work useful, please consider the following citation:
+```
+@article{qin2025incentivizingreasoningadvancedinstructionfollowing,
+      title={Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models},
+      author={Yulei Qin and Gang Li and Zongyi Li and Zihan Xu and Yuchen Shi and Zhekai Lin and Xiao Cui and Ke Li and Xing Sun},
+      year={2025},
+      eprint={2506.01413},
+      archivePrefix={arXiv},
+      primaryClass={cs.CV},
+      url={https://arxiv.org/abs/2506.01413}
+}
+```