Submitted by YannQi 86 R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning · 6 authors 45 2
Submitted by CoCoOne 80 A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers · 103 authors 166 2
Submitted by delinqu 60 EmbodiedOneVision: Interleaved Vision-Text-Action Pretraining for General Robot Control · 15 authors 142 3
Submitted by lixiaochuan 60 Droplet3D: Commonsense Priors from Videos Facilitate 3D Generation · 14 authors 8 2
Submitted by wanng 48 A.S.E: A Repository-Level Benchmark for Evaluating Security in AI-Generated Code · 21 authors 134 3
Submitted by Shunian 16 TalkVid: A Large-Scale Diversified Dataset for Audio-Driven Talking Head Synthesis · 13 authors 73 3
Submitted by taesiri 13 Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models · 8 authors 3
Submitted by XiaohuanZhou 10 TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training · 9 authors 2
Submitted by taesiri 8 UItron: Foundational GUI Agent with Advanced Perception and Planning · 10 authors 2
Submitted by JiaaqiLiu 2 Mimicking the Physicist's Eye:A VLM-centric Approach for Physics Formula Discovery · 15 authors 2
Submitted by gemcollector 1 HERMES: Human-to-Robot Embodied Learning from Multi-Source Motion Data for Mobile Dexterous Manipulation · 7 authors 2
Submitted by nennomp - Deep Residual Echo State Networks: exploring residual orthogonal connections in untrained Recurrent Neural Networks · 3 authors 0 2
Submitted by AllanK24 - Quantization Robustness to Input Degradations for Object Detection · 3 authors 2
Submitted by yhua219 - EduRABSA: An Education Review Dataset for Aspect-based Sentiment Analysis Tasks · 4 authors 3 2