view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control 4 days ago • 76
Fast3R: Towards 3D Reconstruction of 1000+ Images in One Forward Pass Paper • 2501.13928 • Published 15 days ago • 16
Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos Paper • 2501.13826 • Published 15 days ago • 22
MeshLRM: Large Reconstruction Model for High-Quality Mesh Paper • 2404.12385 • Published Apr 18, 2024 • 27