Zimu Lu's picture

Zimu Lu

luzimu

·

mnluzimu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 1 hour ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

upvoted a paper about 1 hour ago

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

upvoted a paper about 1 hour ago

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

View all activity

Organizations

None yet

upvoted 6 papers about 1 hour ago

VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use

Paper • 2509.01055 • Published 3 days ago • 37

UI-TARS-2 Technical Report: Advancing GUI Agent with Multi-Turn Reinforcement Learning

Paper • 2509.02544 • Published about 22 hours ago • 53

SimpleTIR: End-to-End Reinforcement Learning for Multi-Turn Tool-Integrated Reasoning

Paper • 2509.02479 • Published about 23 hours ago • 55

The Landscape of Agentic Reinforcement Learning for LLMs: A Survey

Paper • 2509.02547 • Published about 22 hours ago • 57

How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on τ-bench

Paper • 2508.20931 • Published 6 days ago • 14

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published 6 days ago • 26

upvoted 2 papers 2 days ago

Think in Games: Learning to Reason in Games via Reinforcement Learning with Large Language Models

Paper • 2508.21365 • Published 5 days ago • 19

A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers

Paper • 2508.21148 • Published 6 days ago • 117

updated a collection 5 days ago

WebGen-Bench

Datasets and models introduced in the paper "WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch". • 11 items • Updated 5 days ago • 1

upvoted a collection 5 days ago

WebGen-Bench

Datasets and models introduced in the paper "WebGen-Bench: Evaluating LLMs on Generating Interactive and Functional Websites from Scratch". • 11 items • Updated 5 days ago • 1