unsloth/DeepSeek-R1-Distill-Llama-8B-unsloth-bnb-4bit Text Generation • Updated 9 days ago • 21k • 10
Tarsier2: Advancing Large Vision-Language Models from Detailed Video Description to Comprehensive Video Understanding Paper • 2501.07888 • Published 17 days ago • 15