39 193 47

KABI

dongguanting

https://dongguanting.github.io/

AI & ML interests

Reasoning and Alignment for Large Language Models

Recent Activity

upvoted a paper 4 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

liked a model 6 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

updated a model 6 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

View all activity

Organizations

upvoted a paper 4 days ago

Seed-Prover 1.5: Mastering Undergraduate-Level Theorem Proving via Learning from Experience

Paper • 2512.17260 • Published 7 days ago • 47

liked a model 6 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 6 days ago • 20 • 2

updated 2 models 6 days ago

dongguanting/Qwen3-8B-AEPO-DeepSearch

Text Generation • 8B • Updated 6 days ago • 20 • 2

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 6 days ago • 10

updated a collection 6 days ago

AEPO

Collection

The official datasets and model checkpoints of AEPO • 5 items • Updated 6 days ago • 4

updated a model 6 days ago

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated 6 days ago • 7

updated a collection 6 days ago

ARPO

Collection

The official datasets and model checkpoints of ARPO • 10 items • Updated 6 days ago • 6

upvoted a paper 10 days ago

Memory in the Age of AI Agents

Paper • 2512.13564 • Published 11 days ago • 112

published 2 models 10 days ago

dongguanting/QwQ-32B-ARPO-DeepSearch

33B • Updated 6 days ago • 7

dongguanting/QwQ-32B-AEPO-DeepSearch

Text Generation • 33B • Updated 6 days ago • 10

upvoted 2 papers 10 days ago

Thinking with Images via Self-Calling Agent

Paper • 2512.08511 • Published 17 days ago • 21

Long-horizon Reasoning Agent for Olympiad-Level Mathematical Problem Solving

Paper • 2512.10739 • Published 15 days ago • 45

upvoted a paper 23 days ago

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23 • 274

upvoted a paper 25 days ago

Latent Collaboration in Multi-Agent Systems

Paper • 2511.20639 • Published about 1 month ago • 116

upvoted 2 papers about 1 month ago

DR Tulu: Reinforcement Learning with Evolving Rubrics for Deep Research

Paper • 2511.19399 • Published Nov 24 • 60

General Agentic Memory Via Deep Research

Paper • 2511.18423 • Published Nov 23 • 161

upvoted 3 papers about 2 months ago

KABI

AI & ML interests

Recent Activity

Organizations

dongguanting's activity