LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents Paper β’ 2311.05437 β’ Published Nov 9, 2023 β’ 48
Ziya-VL: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning Paper β’ 2310.08166 β’ Published Oct 12, 2023 β’ 1