Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models Paper • 2501.09686 • Published Jan 16 • 37
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions Paper • 2412.08864 • Published Dec 12, 2024 • 1