Can a Single Model Master Both Multi-turn Conversations and Tool Use? CALM: A Unified Conversational Agentic Language Model Paper โข 2502.08820 โข Published 29 days ago โข 4
ViLMA: A Zero-Shot Benchmark for Linguistic and Temporal Grounding in Video-Language Models Paper โข 2311.07022 โข Published Nov 13, 2023 โข 1