-
rStar2-Agent: Agentic Reasoning Technical Report
Paper • 2508.20722 • Published • 88 -
AgentFly: Fine-tuning LLM Agents without Fine-tuning LLMs
Paper • 2508.16153 • Published • 127 -
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
Paper • 2508.14029 • Published • 116 -
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
Paper • 2508.18265 • Published • 175
peterlee6706
peterlee6706
AI & ML interests
None yet
Recent Activity
updated
a collection
4 days ago
WeekDaily
updated
a collection
4 days ago
WeekDaily
updated
a collection
4 days ago
WeekDaily
Organizations
None yet