Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning Paper • 2506.06205 • Published Jun 6 • 29
BitVLA: 1-bit Vision-Language-Action Models for Robotics Manipulation Paper • 2506.07530 • Published Jun 9 • 20
Ark: An Open-source Python-based Framework for Robot Learning Paper • 2506.21628 • Published Jun 24 • 16
ThinkAct: Vision-Language-Action Reasoning via Reinforced Visual Latent Planning Paper • 2507.16815 • Published Jul 22 • 38
PRIX: Learning to Plan from Raw Pixels for End-to-End Autonomous Driving Paper • 2507.17596 • Published Jul 23 • 5
MolmoAct: Action Reasoning Models that can Reason in Space Paper • 2508.07917 • Published 23 days ago • 41