Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ base_model:
|
|
9 |
|
10 |
## Overview
|
11 |
|
12 |
-
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over
|
13 |
|
14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
15 |
|
@@ -20,7 +20,7 @@ Need to use phrase: "Structure your thoughts. Be attentive to details."
|
|
20 |
|
21 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|
22 |
- **Dataset**: GAIR/LIMO (subset of 611 problems)
|
23 |
-
- **Epochs**:
|
24 |
- **Training Limitations**: The training was constrained by the computational resources available on my machine, which means I haven't yet conducted a thorough evaluation of the model's performance improvements.
|
25 |
|
26 |
## Key Observations
|
|
|
9 |
|
10 |
## Overview
|
11 |
|
12 |
+
This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 14 training epochs.
|
13 |
|
14 |
After testing more I found that the model does not always include reasoning, I will update with more epochs.
|
15 |
|
|
|
20 |
|
21 |
- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
|
22 |
- **Dataset**: GAIR/LIMO (subset of 611 problems)
|
23 |
+
- **Epochs**: 14
|
24 |
- **Training Limitations**: The training was constrained by the computational resources available on my machine, which means I haven't yet conducted a thorough evaluation of the model's performance improvements.
|
25 |
|
26 |
## Key Observations
|