-
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers
Paper • 2408.06195 • Published • 70 -
Training Language Models to Self-Correct via Reinforcement Learning
Paper • 2409.12917 • Published • 136 -
Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters
Paper • 2408.03314 • Published • 54 -
Self-Reflection in LLM Agents: Effects on Problem-Solving Performance
Paper • 2405.06682 • Published • 3
Christophe Protat
Chris126
AI & ML interests
None yet
Recent Activity
updated
a model
2 days ago
Chris126/qwen-r1-aha-moment
published
a model
2 days ago
Chris126/qwen-r1-aha-moment
updated
a collection
5 months ago
Papers to read
Organizations
None yet