Yuting6 commited on
Commit
138210d
·
verified ·
1 Parent(s): 948c3b9

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +30 -0
README.md ADDED
@@ -0,0 +1,30 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ datasets:
4
+ - Yuting6/geoqa-r1v-augmentation
5
+ - Yuting6/math-8k-augmentation
6
+ - Yuting6/m3cot-augmentation
7
+ - Yuting6/TQA-augmentation
8
+ - Yuting6/Geo3k-augmentation
9
+ - Yuting6/geoqa-r1v-noise
10
+ - Yuting6/geoqa-r1v-crop
11
+ - Yuting6/geoqa-r1v-blur
12
+ - Yuting6/geoqa-r1v-8k-rotated
13
+ - Yuting6/geoqa-r1v-8k-mixup
14
+ base_model:
15
+ - Qwen/Qwen2.5-VL-7B-Instruct
16
+ ---
17
+
18
+
19
+ # Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning
20
+
21
+ ## Paper Title and Link
22
+
23
+ The model was presented in the paper [Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning](https://arxiv.org/abs/2506.09736). You can also find the paper on arXiv: [Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math Reasoning (arXiv:2506.09736)](https://arxiv.org/abs/2506.09736)
24
+
25
+ ## Paper Abstract
26
+
27
+ Vision-Matters is a simple visual perturbation framework that can be easily integrated into existing post-training pipelines including SFT, DPO, and GRPO. Our findings highlight the critical role of visual perturbation: better reasoning begins with better seeing.
28
+
29
+ * 🐙 **GitHub Repo:** [YutingLi0606/Vision-Matters](https://github.com/YutingLi0606/Vision-Matters)
30
+ * 💾 **Dataset:** [Yuting6/vision-matters on Hugging Face](https://huggingface.co/collections/Yuting6/vision-matters-684801dd1879d3e639a930d1)