Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B Paper • 2511.06221 • Published Nov 9 • 131
Running on CPU Upgrade Featured 2.75k The Smol Training Playbook 📚 2.75k The secrets to building world-class LLMs
Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning Paper • 2507.00432 • Published Jul 1 • 79
deepseek-ai/DeepSeek-R1-0528-Qwen3-8B Text Generation • 8B • Updated May 29 • 473k • • 1.01k