VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper • 2509.01055 • Published 3 days ago • 37
UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning Paper • 2509.02544 • Published about 22 hours ago • 53
SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning Paper • 2509.02479 • Published about 23 hours ago • 55
The Landscape of Agentic Reinforcement Learning for LLMs: A Survey Paper • 2509.02547 • Published about 22 hours ago • 57
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench Paper • 2508.20931 • Published 6 days ago • 14
PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning Paper • 2508.21104 • Published 6 days ago • 26
Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models Paper • 2508.21365 • Published 5 days ago • 19
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers Paper • 2508.21148 • Published 6 days ago • 117
WebGen-Bench Collection Datasets and models introduced in the paper "WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch". • 11 items • Updated 5 days ago • 1
WebGen-Bench Collection Datasets and models introduced in the paper "WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch". • 11 items • Updated 5 days ago • 1