AIMv2 Collection A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint. • 19 items • Updated Nov 22, 2024 • 74
Running on CPU Upgrade 671 671 Open ASR Leaderboard 🏆 Request evaluation of a speech recognition model
LanguageBind/LanguageBind_Video_Huge_V1.5_FT Zero-Shot Image Classification • Updated Feb 1, 2024 • 1.74k • 4
Running 105 105 Llmlingua 2 💻 Compress lengthy prompts into shorter versions while preserving key information
Llama 3.1 Collection This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 653
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language Models Paper • 2407.15841 • Published Jul 22, 2024 • 40