Chancy commited on
Commit
dcab747
·
verified ·
1 Parent(s): 5fe2d6f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -0
README.md CHANGED
@@ -50,6 +50,9 @@ The code and data for reproducing our results can be found in our [github repo](
50
 
51
  | **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
52
  | --- | --- | --- | --- | --- | --- |
 
 
 
53
  | Deepseek-R1-Distill-Qwen-7B | 55.0 | 39.7 | 36.7 | 56.8 | 81.9 |
54
  | AReal-boba-RL-7B | 61.9 | 48.3 | 39.5 | 61.9 | 86.4 |
55
  | Skywork-OR1-7B-Math | 69.8 | 52.3 | **40.8** | 63.2 | 85.3 |
 
50
 
51
  | **Models** | **AIME24 avg@32** | **AIME25 avg@32** | **Minerva Math avg@4** | **Olympiad Bench avg@4** | **AMC23 avg@8** |
52
  | --- | --- | --- | --- | --- | --- |
53
+ | DeepScaleR-1.5B | 43.1 | 27.2 | 34.6 | 40.7 | 50.6 |
54
+ | Qwen3-1.7B | 48.3 | 36.8 | 34.9 | 55.1 | 75.6 |
55
+ | **`POLARIS-1.7B-Preview`** | **66.9** | **53.0** | **38.9** | **63.8** | **85.8** |
56
  | Deepseek-R1-Distill-Qwen-7B | 55.0 | 39.7 | 36.7 | 56.8 | 81.9 |
57
  | AReal-boba-RL-7B | 61.9 | 48.3 | 39.5 | 61.9 | 86.4 |
58
  | Skywork-OR1-7B-Math | 69.8 | 52.3 | **40.8** | 63.2 | 85.3 |