Update README.md
Browse files
README.md
CHANGED
@@ -50,6 +50,9 @@ The code and data for reproducing our results can be found in our [github repo](
|
|
50 |
|
51 |
| **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
|
52 |
| --- | --- | --- | --- | --- | --- |
|
|
|
|
|
|
|
53 |
| Deepseek-R1-Distill-Qwen-7B | 55.0 | 39.7 | 36.7 | 56.8 | 81.9 |
|
54 |
| AReal-boba-RL-7B | 61.9 | 48.3 | 39.5 | 61.9 | 86.4 |
|
55 |
| Skywork-OR1-7B-Math | 69.8 | 52.3 | **40.8** | 63.2 | 85.3 |
|
|
|
50 |
|
51 |
| **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
|
52 |
| --- | --- | --- | --- | --- | --- |
|
53 |
+
| DeepScaleR-1.5B | 43.1 | 27.2 | 34.6 | 40.7 | 50.6 |
|
54 |
+
| Qwen3-1.7B | 48.3 | 36.8 | 34.9 | 55.1 | 75.6 |
|
55 |
+
| **`POLARIS-1.7B-Preview`** | **66.9** | **53.0** | **38.9** | **63.8** | **85.8** |
|
56 |
| Deepseek-R1-Distill-Qwen-7B | 55.0 | 39.7 | 36.7 | 56.8 | 81.9 |
|
57 |
| AReal-boba-RL-7B | 61.9 | 48.3 | 39.5 | 61.9 | 86.4 |
|
58 |
| Skywork-OR1-7B-Math | 69.8 | 52.3 | **40.8** | 63.2 | 85.3 |
|