Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,6 @@ pipeline_tag: video-text-to-text
|
|
7 |
|
8 |
This repo contains model checkpoints for **VISTA-LongVA**. [VISTA](https://huggingface.co/papers/2412.00927) is a video spatiotemporal augmentation method that generates long-duration and high-resolution video instruction-following data to enhance the video understanding capabilities of video LMMs.
|
9 |
|
10 |
-
### This repo is under construction. Please stay tuned.
|
11 |
[**π Homepage**](https://tiger-ai-lab.github.io/VISTA/) | [**π arXiv**](https://arxiv.org/abs/2412.00927) | [**π» GitHub**](https://github.com/TIGER-AI-Lab/VISTA) | [**π€ VISTA-400K**](https://huggingface.co/datasets/TIGER-Lab/VISTA-400K) | [**π€ Models**](https://huggingface.co/collections/TIGER-Lab/vista-674a2f0fab81be728a673193) | [**π€ HRVideoBench**](https://huggingface.co/datasets/TIGER-Lab/HRVideoBench)
|
12 |
|
13 |
## Video Instruction Data Synthesis Pipeline
|
|
|
7 |
|
8 |
This repo contains model checkpoints for **VISTA-LongVA**. [VISTA](https://huggingface.co/papers/2412.00927) is a video spatiotemporal augmentation method that generates long-duration and high-resolution video instruction-following data to enhance the video understanding capabilities of video LMMs.
|
9 |
|
|
|
10 |
[**π Homepage**](https://tiger-ai-lab.github.io/VISTA/) | [**π arXiv**](https://arxiv.org/abs/2412.00927) | [**π» GitHub**](https://github.com/TIGER-AI-Lab/VISTA) | [**π€ VISTA-400K**](https://huggingface.co/datasets/TIGER-Lab/VISTA-400K) | [**π€ Models**](https://huggingface.co/collections/TIGER-Lab/vista-674a2f0fab81be728a673193) | [**π€ HRVideoBench**](https://huggingface.co/datasets/TIGER-Lab/HRVideoBench)
|
11 |
|
12 |
## Video Instruction Data Synthesis Pipeline
|