
Inst-IT/LLaVA-Next-Inst-It-Qwen2-7B
Video-Text-to-Text
•
Updated
•
37
•
3
A series of LMMs finetuned with the Inst-IT Dataset, skilled in fine-grained image/video understanding at the instance-level.