Update README.md
Browse files
README.md
CHANGED
|
@@ -39,15 +39,14 @@ In the table below, we bold values in each column that are within 2 standard err
|
|
| 39 |
|
| 40 |
| Model | Data | AIME24 | AIME25 | AMC23 | MATH500 | HMMT O2/25 | LCB 06/24-01/25 | CodeElo | CodeForces | GPQA-D | JEEBench |
|
| 41 |
| ----------------------------------------------------------------------------------------------- | ----- | ------ | ------ | ------ | ------- | ---------- | --------------- | ------- | ---------- | ------ | -------- |
|
| 42 |
-
| [OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B) | β
|
|
| 43 |
-
| [OpenThinker2-7B](https://huggingface.co/open-thoughts/OpenThinker2-7B) | β
|
|
| 44 |
| **[OpenThinker3-7B](https://huggingface.co/open-thoughts/OpenThinker3-7B)** | β
|**69.0**|**53.3**|**93.5**| **90.0**| **42.7** | **51.7** | 31.0 |**32.2** | 53.7 |**72.4** |
|
| 45 |
| [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) | β | 51.3 | 38.0 | 92.0 | 88.0 | 25.0 | 34.5 | 19.9 | 21.1 | 33.2 | 50.4 |
|
| 46 |
| [OpenR1-Distill-7B](https://huggingface.co/open-r1/OpenR1-Distill-7B) | β
| 57.7 | 39.7 | 87.0 | 88.0 | 25.7 | 30.7 | 30.1 | 29.3 |**58.9**| 68.7 |
|
| 47 |
| [Llama-3.1-Nemotron-Nano-8B-v1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1) | β
| 62.0 | 48.0 |**94.0**| 89.4 | 26.7 | **50.9** | 30.9 |**32.9** | 52.9 | 70.7 |
|
| 48 |
| [AceReason-Nemotron-7B](https://huggingface.co/nvidia/AceReason-Nemotron-7B) | β
|**71.0**| 50.7 |**93.8**| 89.8 | 33.3 | 44.3 |**32.9** |**30.9** | 52.9 | 64.3 |
|
| 49 |
|
| 50 |
-
|
| 51 |
# Data
|
| 52 |
|
| 53 |
This model was trained on the [OpenThoughts3-1.2M](https://huggingface.co/datasets/open-thoughts/OpenThoughts3-1.2M) dataset.
|
|
|
|
| 39 |
|
| 40 |
| Model | Data | AIME24 | AIME25 | AMC23 | MATH500 | HMMT O2/25 | LCB 06/24-01/25 | CodeElo | CodeForces | GPQA-D | JEEBench |
|
| 41 |
| ----------------------------------------------------------------------------------------------- | ----- | ------ | ------ | ------ | ------- | ---------- | --------------- | ------- | ---------- | ------ | -------- |
|
| 42 |
+
| [OpenThinker-7B](https://huggingface.co/open-thoughts/OpenThinker-7B) | β
| 30.7 | 22.0 | 72.5 | 82.8 | 15.7 | 26.1 | 11.1 | 14.9 | 38.6 | 45.3 |
|
| 43 |
+
| [OpenThinker2-7B](https://huggingface.co/open-thoughts/OpenThinker2-7B) | β
| 60.7 | 38.7 | 89.8 | 87.6 | 24.7 | 40.6 | 22.8 | 26.6 | 47.0 | 65.1 |
|
| 44 |
| **[OpenThinker3-7B](https://huggingface.co/open-thoughts/OpenThinker3-7B)** | β
|**69.0**|**53.3**|**93.5**| **90.0**| **42.7** | **51.7** | 31.0 |**32.2** | 53.7 |**72.4** |
|
| 45 |
| [DeepSeek-R1-Distill-Qwen-32B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-32B) | β | 51.3 | 38.0 | 92.0 | 88.0 | 25.0 | 34.5 | 19.9 | 21.1 | 33.2 | 50.4 |
|
| 46 |
| [OpenR1-Distill-7B](https://huggingface.co/open-r1/OpenR1-Distill-7B) | β
| 57.7 | 39.7 | 87.0 | 88.0 | 25.7 | 30.7 | 30.1 | 29.3 |**58.9**| 68.7 |
|
| 47 |
| [Llama-3.1-Nemotron-Nano-8B-v1](https://huggingface.co/nvidia/Llama-3.1-Nemotron-Nano-8B-v1) | β
| 62.0 | 48.0 |**94.0**| 89.4 | 26.7 | **50.9** | 30.9 |**32.9** | 52.9 | 70.7 |
|
| 48 |
| [AceReason-Nemotron-7B](https://huggingface.co/nvidia/AceReason-Nemotron-7B) | β
|**71.0**| 50.7 |**93.8**| 89.8 | 33.3 | 44.3 |**32.9** |**30.9** | 52.9 | 64.3 |
|
| 49 |
|
|
|
|
| 50 |
# Data
|
| 51 |
|
| 52 |
This model was trained on the [OpenThoughts3-1.2M](https://huggingface.co/datasets/open-thoughts/OpenThoughts3-1.2M) dataset.
|