Vchitect
/

ShotVL-3B

Image-Text-to-Text

vision-language

text-generation-inference

Model card Files Files and versions

Alexislhb commited on Jul 1

Commit

a2503a4

·

verified ·

1 Parent(s): 0c1c4b5

Update README.md

Files changed (1) hide show

README.md +2 -2

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ This repository contains **ShotVL-3B**, a fine-tuned version of [Qwen/Qwen2.5-VL
 *   **Paper:** [ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models](https://arxiv.org/abs/2506.21356)
 *   **Project Page:** [https://vchitect.github.io/ShotBench-project/](https://vchitect.github.io/ShotBench-project/)
-*   **Code:** [https://github.com/Alexios-hub/ShotBench](https://github.com/Alexios-hub/ShotBench)
 ## Abstract
@@ -26,7 +26,7 @@ Cinematography, the fundamental visual language of film, is essential for convey
 This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
-### Demo Code
 **Image**
 ```python

 *   **Paper:** [ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models](https://arxiv.org/abs/2506.21356)
 *   **Project Page:** [https://vchitect.github.io/ShotBench-project/](https://vchitect.github.io/ShotBench-project/)
+*   **Code:** [https://github.com/Vchitect/ShotBench](https://github.com/Vchitect/ShotBench)
 ## Abstract
 This model is a fine-tuned version of [Qwen/Qwen2.5-VL-3B-Instruct](https://huggingface.co/Qwen/Qwen2.5-VL-3B-Instruct), trained by supervised fine-tuning and GRPO on the largest and high-quality dataset for cinematic language understanding to date. It currently achieves state-of-the-art performance on [ShotBench](https://vchitect.github.io/ShotBench-project/), a comprehensive benchmark for evaluating cinematography understanding in vision-language models.
+### Demo
 **Image**
 ```python