DeepDream2045
/

73aef344-c2dd-4945-af6e-62519d2fccf8

Generated from Trainer

Model card Files Files and versions Community

DeepDream2045 commited on Dec 16, 2024

Commit

cf21f7c

·

verified ·

1 Parent(s): 7266185

End of training

Files changed (2) hide show

README.md +3 -3
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -102,7 +102,7 @@ xformers_attention: true
 This model is a fine-tuned version of [unsloth/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/unsloth/Qwen2.5-Coder-1.5B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0069
 ## Model description
@@ -140,8 +140,8 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 0.4995        | 0.0006 | 1    | 1.4900          |
-| 1.9502        | 0.0139 | 25   | 1.0298          |
-| 1.7977        | 0.0278 | 50   | 1.0069          |
 ### Framework versions

 This model is a fine-tuned version of [unsloth/Qwen2.5-Coder-1.5B-Instruct](https://huggingface.co/unsloth/Qwen2.5-Coder-1.5B-Instruct) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.0076
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 0.4995        | 0.0006 | 1    | 1.4900          |
+| 1.9567        | 0.0139 | 25   | 1.0318          |
+| 1.7992        | 0.0278 | 50   | 1.0076          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:453ddd5fd4b7adadff3b889c7fb0116cfe534e5a63cb3f1a7ff23d9fa5eae03d
 size 147859242

 version https://git-lfs.github.com/spec/v1
+oid sha256:ab9d541aa9d377b2156e4c853ba88569ba88417f196aa044616d54fbad042a56
 size 147859242