CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments Paper • 2503.00729 • Published 16 days ago • 3
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published 15 days ago • 74
DexTrack: Towards Generalizable Neural Tracking Control for Dexterous Manipulation from Human References Paper • 2502.09614 • Published Feb 13 • 12
STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning Paper • 2502.10177 • Published Feb 14 • 6
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 8 items • Updated 22 days ago • 400
MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making Paper • 2409.16686 • Published Sep 25, 2024 • 10