End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -18,7 +18,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [TheBloke/zephyr-7B-beta-GPTQ](https://huggingface.co/TheBloke/zephyr-7B-beta-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3724
 ## Model description
@@ -44,18 +44,23 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1
-- training_steps: 500
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.3837        | 0.0   | 100  | 0.4141          |
-| 0.3307        | 0.01  | 200  | 0.3936          |
-| 0.2929        | 0.01  | 300  | 0.3840          |
-| 0.2848        | 0.01  | 400  | 0.3761          |
-| 0.2959        | 0.02  | 500  | 0.3724          |
 ### Framework versions

 This model is a fine-tuned version of [TheBloke/zephyr-7B-beta-GPTQ](https://huggingface.co/TheBloke/zephyr-7B-beta-GPTQ) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2649
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 1
+- training_steps: 1000
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.4003        | 0.0   | 100  | 0.3292          |
+| 0.3077        | 0.01  | 200  | 0.3024          |
+| 0.2775        | 0.01  | 300  | 0.2978          |
+| 0.2873        | 0.01  | 400  | 0.2866          |
+| 0.2782        | 0.02  | 500  | 0.2805          |
+| 0.2859        | 0.02  | 600  | 0.2740          |
+| 0.2573        | 0.02  | 700  | 0.2709          |
+| 0.2753        | 0.03  | 800  | 0.2679          |
+| 0.265         | 0.03  | 900  | 0.2660          |
+| 0.263         | 0.03  | 1000 | 0.2649          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4ae5485d1ad46b6ac6b30851d11bb60b0509fc7ac5731a203399c6e68d1ae58f
 size 54560368

 version https://git-lfs.github.com/spec/v1
+oid sha256:b0ebbf4a29dad20ad86c6d5ffc6f8505f38be203ca636141fb2db18d96233a68
 size 54560368

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:802d7ac76e4ec938e8fad6ed69df7ce4ed44f0162a9cb4ef6392381cdc35646b
 size 4728

 version https://git-lfs.github.com/spec/v1
+oid sha256:c8c7f01498084b64169cffbc93643166cb47c2b01ecfc2645cd30c6c54b02718
 size 4728