Alexislhb commited on
Commit
a2503a4
·
verified ·
1 Parent(s): 0c1c4b5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -2
README.md CHANGED
@@ -16,7 +16,7 @@ This repository contains **ShotVL-3B**, a fine-tuned version of [Qwen/Qwen2.5-VL
16
 
17
  * **Paper:** [ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models](https://arxiv.org/abs/2506.21356)
18
  * **Project Page:** [https://vchitect.github.io/ShotBench-project/](https://vchitect.github.io/ShotBench-project/)
19
- * **Code:** [https://github.com/Alexios-hub/ShotBench](https://github.com/Alexios-hub/ShotBench)
20
 
21
  ## Abstract
22
 
@@ -26,7 +26,7 @@ Cinematography, the fundamental visual language of film, is essential for convey
26
 
27
  This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
28
 
29
- ### Demo Code
30
 
31
  **Image**
32
  ```python
 
16
 
17
  * **Paper:** [ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models](https://arxiv.org/abs/2506.21356)
18
  * **Project Page:** [https://vchitect.github.io/ShotBench-project/](https://vchitect.github.io/ShotBench-project/)
19
+ * **Code:** [https://github.com/Vchitect/ShotBench](https://github.com/Vchitect/ShotBench)
20
 
21
  ## Abstract
22
 
 
26
 
27
  This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
28
 
29
+ ### Demo
30
 
31
  **Image**
32
  ```python