yang's picture

7

yang

fengfan933

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

upvoted a paper 11 days ago

Implicit Reasoning in Transformers is Reasoning through Shortcuts

upvoted a paper about 1 month ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

View all activity

Organizations

None yet

fengfan933's activity

upvoted a paper 3 days ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published 4 days ago • 89

upvoted a paper 11 days ago

Implicit Reasoning in Transformers is Reasoning through Shortcuts

Paper • 2503.07604 • Published 12 days ago • 20

upvoted 2 papers about 1 month ago

CoSER: Coordinating LLM-Based Persona Simulation of Established Roles

Paper • 2502.09082 • Published Feb 13 • 28

Distillation Scaling Laws

Paper • 2502.08606 • Published Feb 12 • 46

upvoted 3 papers about 2 months ago

Kimi k1.5: Scaling Reinforcement Learning with LLMs

Paper • 2501.12599 • Published Jan 22 • 105

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 354

Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training

Paper • 2501.11425 • Published Jan 20 • 94