File size: 1,494 Bytes
0f246e9 8064915 2c24f0f 0f246e9 2c24f0f 0f246e9 2c24f0f 0f246e9 8bcbb3a 0f246e9 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 |
---
license: apache-2.0
---
<p align="center">
<b><font size="6">Dispider</font></b>
<p>
<div align="center">
[💻Github Repo](https://github.com/Mark12Ding/Dispider)
[📖Paper](https://arxiv.org/abs/2501.03218)
</div>
## Quick Start
First download the checkpoints at the folder.
**Important**: Modify the ``mm_compressor`` path in config.json to align with your local environment. The checkpoint for ``mm_compressor`` is located within a sub-folder of this repository.
For detailed evaluation, please refer to [Github repo](https://github.com/Mark12Ding/Dispider).
## ✒️ Citation
If you find our work helpful for your research, please consider giving a star ⭐ and citation 📝.
```bibtex
@article{qian2025dispider,
title={Dispider: Enabling Video LLMs with Active Real-Time Interaction via Disentangled Perception, Decision, and Reaction},
author={Qian, Rui and Ding, Shuangrui and Dong, Xiaoyi and Zhang, Pan and Zang, Yuhang and Cao, Yuhang and Lin, Dahua and Wang, Jiaqi},
journal={arXiv preprint arXiv:2501.03218},
year={2025}
}
@article{qian2025streaming,
title={Streaming long video understanding with large language models},
author={Qian, Rui and Dong, Xiaoyi and Zhang, Pan and Zang, Yuhang and Ding, Shuangrui and Lin, Dahua and Wang, Jiaqi},
journal={Advances in Neural Information Processing Systems},
volume={37},
pages={119336--119360},
year={2025}
}
``` |