The collection for the Paper "Walk Before You Run! Concise LLM Reasoning via Reinforcement Learning"
Mingyang Song
Nickyang
AI & ML interests
LRMs, Long-Context LLMs, LLM Judges, Many-Shot ICL
Recent Activity
upvoted a paper about 8 hours ago
A Survey of On-Policy Distillation for Large Language Models submitted a paper 1 day ago
A Survey of On-Policy Distillation for Large Language Models upvoted a collection 3 months ago
HY-MT1.5Organizations
None yet