evilyesh
/

Qwen2.5-Coder-14B-Instruct-Thinking

Model card Files Files and versions

evilyesh commited on Feb 25

Commit

ed270c5

·

verified ·

1 Parent(s): 1e2e419

Upload README.md

Files changed (1) hide show

README.md +26 -1

README.md CHANGED Viewed

@@ -3,4 +3,29 @@ datasets:
 - GAIR/LIMO
 base_model:
 - Qwen/Qwen2.5-Coder-14B-Instruct
----

 - GAIR/LIMO
 base_model:
 - Qwen/Qwen2.5-Coder-14B-Instruct
+---
+# Fine-Tuned Model: Qwen/Qwen2.5-Coder-14B-Instruct on GAIR/LIMO
+## Overview
+This model is a fine-tuned version of the base model Qwen/Qwen2.5-Coder-14B-Instruct. It was trained on a subset of problems from the GAIR/LIMO dataset, specifically focusing on 611 problems over 2 training epochs.
+## Training Details
+- **Base Model**: Qwen/Qwen2.5-Coder-14B-Instruct
+- **Dataset**: GAIR/LIMO (subset of 611 problems)
+- **Epochs**: 2
+- **Training Limitations**: The training was constrained by the computational resources available on my machine, which means I haven't yet conducted a thorough evaluation of the model's performance improvements.
+## Key Observations
+During testing, the fine-tuned model demonstrated significant improvements in reasoning ability compared to the base model. It began to provide more coherent and accurate responses, avoiding the mistakes observed in the base model during my initial tests.
+## Next Steps
+While preliminary results are promising, further evaluation is needed to assess the overall improvement in model quality. I encourage the community to test the model and share their findings. Your feedback will be invaluable in understanding the extent of the improvements.
+## Acknowledgments
+Special thanks to the creators of the Qwen/Qwen2.5-Coder-14B-Instruct and GAIR/LIMO datasets for providing the foundational resources.