Update README.md
Browse files
README.md
CHANGED
|
@@ -12,6 +12,7 @@ base_model:
|
|
| 12 |
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 4 training epochs.
|
| 13 |
|
| 14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
|
|
|
| 15 |
|
| 16 |
## Training Details
|
| 17 |
|
|
|
|
| 12 |
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 4 training epochs.
|
| 13 |
|
| 14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
| 15 |
+
Warning! The model often goes into an endless chain of reasoning.
|
| 16 |
|
| 17 |
## Training Details
|
| 18 |
|