Elliott
/

LUFFY-Qwen-Instruct-7B

Text Generation

text-generation-inference

Model card Files Files and versions

Elliott commited on Apr 23

Commit

3e1c7ed

·

verified ·

1 Parent(s): 92255c9

Update README.md

Files changed (1) hide show

README.md +0 -3

README.md CHANGED Viewed

@@ -49,9 +49,6 @@ print(outputs[0].outputs[0].text)
 # 📃Evaluation
-LUFFY is evaluated on six competition-level benchmarks, achieving state-of-the-art results among all zero-RL methods. It surpasses both on-policy RL and imitation learning (SFT), especially in generalization:
-## LUFFY on Qwen2.5-Instruct-7B
 | **Model**                          | **AIME 2024** | **AIME 2025** | **AMC** | **MATH-500** | **Minerva** | **Olympiad** | **Avg.** |
 |-----------------------------------|-------------|-------------|---------|---------------|-------------|---------------|----------|
 | Qwen2.5-7B-Instruct           | 11.9 | 7.6 | 44.1 | 74.6 | 30.5 | 39.7 | 34.7 |

 # 📃Evaluation
 | **Model**                          | **AIME 2024** | **AIME 2025** | **AMC** | **MATH-500** | **Minerva** | **Olympiad** | **Avg.** |
 |-----------------------------------|-------------|-------------|---------|---------------|-------------|---------------|----------|
 | Qwen2.5-7B-Instruct           | 11.9 | 7.6 | 44.1 | 74.6 | 30.5 | 39.7 | 34.7 |