ZeroI2V: Zero-Cost Adaptation of Pre-trained Transformers from Image to Video Paper • 2310.01324 • Published Oct 2, 2023 • 1
InternVideo2: Scaling Video Foundation Models for Multimodal Video Understanding Paper • 2403.15377 • Published Mar 22, 2024 • 25
VideoChat-Flash: Hierarchical Compression for Long-Context Video Modeling Paper • 2501.00574 • Published Dec 31, 2024 • 6