RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Paper • 2601.05241 • Published about 19 hours ago • 19
RoboVIP: Multi-View Video Generation with Visual Identity Prompting Augments Robot Manipulation Paper • 2601.05241 • Published about 19 hours ago • 19
Harmony: Harmonizing Audio and Video Generation through Cross-Task Synergy Paper • 2511.21579 • Published Nov 26, 2025 • 23
Running on CPU Upgrade Featured 2.82k The Smol Training Playbook 📚 2.82k The secrets to building world-class LLMs
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning Paper • 2509.22281 • Published Sep 26, 2025 • 32 • 3
AnySplat: Feed-forward 3D Gaussian Splatting from Unconstrained Views Paper • 2505.23716 • Published May 29, 2025 • 31
ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians Paper • 2406.16815 • Published Jun 24, 2024 • 7
Portrait3D: 3D Head Generation from Single In-the-wild Portrait Image Paper • 2406.16710 • Published Jun 24, 2024
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning Paper • 2509.22281 • Published Sep 26, 2025 • 32
MesaTask: Towards Task-Driven Tabletop Scene Generation via 3D Spatial Reasoning Paper • 2509.22281 • Published Sep 26, 2025 • 32
Manipulation Collection Manipulation-related datasets and models • 15 items • Updated Sep 29, 2025 • 9