AI & ML interests
None defined yet.
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 52 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 25 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.11k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.12k • 1
Model weights & datasets in the paper "Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn’t"
-
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't
Paper • 2503.16219 • Published • 52 -
knoveleng/OpenRS-GRPO
Text Generation • 2B • Updated • 25 • 5 -
knoveleng/Open-RS1
Text Generation • 2B • Updated • 1.11k • 4 -
knoveleng/Open-RS2
Text Generation • 2B • Updated • 1.12k • 1
datasets
7
knoveleng/redbench
Viewer
•
Updated
•
29.4k
•
246
knoveleng/open-rs
Viewer
•
Updated
•
7k
•
1.12k
•
11
knoveleng/open-deepscaler
Viewer
•
Updated
•
21k
•
34
•
4
knoveleng/open-s1
Viewer
•
Updated
•
18.6k
•
78
•
4
knoveleng/AMC-23
Viewer
•
Updated
•
40
•
6.35k
•
1
knoveleng/OlympiadBench
Viewer
•
Updated
•
675
•
4.94k
•
1
knoveleng/Minerva-Math
Viewer
•
Updated
•
272
•
5.05k
•
1