Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 3 days ago • 87
Intuitive physics understanding emerges from self-supervised pretraining on natural videos Paper • 2502.11831 • Published 24 days ago • 18
Learning Getting-Up Policies for Real-World Humanoid Robots Paper • 2502.12152 • Published 24 days ago • 37
Learning to Move Like Professional Counter-Strike Players Paper • 2408.13934 • Published Aug 25, 2024 • 23
High-Dimension Human Value Representation in Large Language Models Paper • 2404.07900 • Published Apr 11, 2024 • 1
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing Paper • 2306.11029 • Published Jun 19, 2023 • 1