VideoMind Collection VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning • 8 items • Updated 3 days ago • 3
Edit Transfer: Learning Image Editing via Vision In-Context Relations Paper • 2503.13327 • Published 17 days ago • 28
Long-Context Autoregressive Video Modeling with Next-Frame Prediction Paper • 2503.19325 • Published 10 days ago • 70
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 17 days ago • 15
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 17 days ago • 15
VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning Paper • 2503.13444 • Published 17 days ago • 15 • 2
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Paper • 2503.09402 • Published 22 days ago • 6
VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary Paper • 2503.09402 • Published 22 days ago • 6 • 2
Automated Movie Generation via Multi-Agent CoT Planning Paper • 2503.07314 • Published 24 days ago • 42