Spaces:
Running
Running
Distilled R1 Responses Unstructured to Structured
#8 opened about 3 hours ago
by
bhaviktheslider
SmolLm2-135 R1 Distill
#5 opened about 13 hours ago
by
ewre324
What is the compute needed for GRPO for 7B R1-Distill model?
#4 opened 2 days ago
by
AndrewSanders
Reproducing Deepseek's numbers for MATH-500
#3 opened 2 days ago
by
edbeeching
Recommend a dataset in the scientific domain made by us: EricLu/SCP-116K
2
#2 opened 3 days ago
by
EricLu
LLM Benchmarks and Data Leakage
3
#1 opened 3 days ago
by
dvamvour