How far can we go with ImageNet for Text-to-Image generation? Paper • 2502.21318 • Published 11 days ago • 25
Around the World in 80 Timesteps: A Generative Approach to Global Visual Geolocation Paper • 2412.06781 • Published Dec 9, 2024 • 21
Short Film Dataset (SFD): A Benchmark for Story-Level Video Understanding Paper • 2406.10221 • Published Jun 14, 2024