TIGER-Lab
/

VL-Rethinker-72B

Visual Question Answering

text-generation-inference

Model card Files Files and versions

JasperHaozhe commited on Apr 21

Commit

320acaf

·

verified ·

1 Parent(s): 35da444

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ pipeline_tag: visual-question-answering
 # VL-Rethinker-72B
 **VL-Rethinker-72B** achieves SoTA results on various multimodal reasoning benchmarks.
 It is trained using the **Forced Rethinking** technique, on top of [**VL-Reasoner**](https://huggingface.co/TIGER-Lab/VL-Reasoner-72B/) with **GRPO-SSR** training.
@@ -27,7 +29,7 @@ Explore further via the following links:
 ## Citation
 If you feel this model useful, please give us a free cite:
-```
 @article{vl-rethinker,
       title={VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning},
       author = {Wang, Haozhe and Qu, Chao and Huang, Zuming and Chu, Wei and Lin, Fangzhen and Chen, Wenhu},

 # VL-Rethinker-72B
+**🚀 News:** <u>We release our meticulously curated collection of RL training queries for multimodal reasoning: [ViRL39K](https://huggingface.co/datasets/TIGER-Lab/ViRL39K).</u>
 **VL-Rethinker-72B** achieves SoTA results on various multimodal reasoning benchmarks.
 It is trained using the **Forced Rethinking** technique, on top of [**VL-Reasoner**](https://huggingface.co/TIGER-Lab/VL-Reasoner-72B/) with **GRPO-SSR** training.
 ## Citation
 If you feel this model useful, please give us a free cite:
+```bibtex
 @article{vl-rethinker,
       title={VL-Rethinker: Incentivizing Self-Reflection of Vision-Language Models with Reinforcement Learning},
       author = {Wang, Haozhe and Qu, Chao and Huang, Zuming and Chu, Wei and Lin, Fangzhen and Chen, Wenhu},