qingyangzhang/Qwen2.5-3B-GRPO-Natural-Reasoning-stage-2 Text Generation • 3B • Updated May 1 • 4
qingyangzhang/Qwen2.5-3B-EMPO-Natural-Reasoning-from-base Text Generation • 3B • Updated Apr 20 • 6
qingyangzhang/Qwen2.5-3B-EMPO-Natural-Reasoning-STEM-20K-free-form Text Generation • 3B • Updated Apr 18 • 4
qingyangzhang/Qwen2.5-3B-EMPO-Natural-Reasoning-STEM-20K-short-answer Text Generation • 3B • Updated Apr 16 • 6