LLaVA-Video - a lmms-lab Collection

lmms-lab 's Collections

EgoLife

LLaVA-OneVision

LongVA

LLaVA-Next-Interleave

LLaVA-Video

updated Feb 21

Models focus on video understanding (previously known as LLaVA-NeXT-Video).

Video Instruction Tuning With Synthetic Data

Paper • 2410.02713 • Published Oct 3, 2024 • 38
lmms-lab/LLaVA-Video-178K

Viewer • Updated Oct 11, 2024 • 1.63M • 19.5k • 125
lmms-lab/LLaVA-Video-7B-Qwen2

Video-Text-to-Text • Updated Oct 25, 2024 • 46.1k • 86
lmms-lab/LLaVA-Video-72B-Qwen2

Text Generation • Updated Oct 25, 2024 • 3.06k • 18
lmms-lab/LLaVA-Video-7B-Qwen2-Video-Only

Text Generation • Updated Oct 4, 2024 • 1.51k • 4
lmms-lab/LLaVA-NeXT-Video-7B-DPO

Video-Text-to-Text • Updated Feb 21 • 2.53k • 25
lmms-lab/LLaVA-NeXT-Video-7B

Video-Text-to-Text • Updated Feb 21 • 1.24k • 45