ericrisco/llama2-instruct-tune-500s

Browse files

Files changed (4) hide show

README.md +23 -23
adapter_model.safetensors +1 -1
runs/Sep26_13-42-54_3d63c6bcbcfd/events.out.tfevents.1727358175.3d63c6bcbcfd.3916.1 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -19,7 +19,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the generator dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.6761
 ## Model description
@@ -51,30 +51,30 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.9468        | 0.0027 | 20   | 1.8141          |
-| 1.8737        | 0.0054 | 40   | 1.7848          |
 | 1.8769        | 0.0081 | 60   | 1.7718          |
-| 1.8634        | 0.0108 | 80   | 1.7598          |
-| 1.8584        | 0.0135 | 100  | 1.7469          |
-| 1.8271        | 0.0163 | 120  | 1.7170          |
-| 1.8706        | 0.0190 | 140  | 1.7042          |
-| 1.8306        | 0.0217 | 160  | 1.7005          |
-| 1.7954        | 0.0244 | 180  | 1.6948          |
-| 1.8616        | 0.0271 | 200  | 1.6947          |
 | 1.81          | 0.0298 | 220  | 1.6915          |
-| 1.8003        | 0.0325 | 240  | 1.6900          |
-| 1.9069        | 0.0352 | 260  | 1.6880          |
-| 1.8266        | 0.0379 | 280  | 1.6868          |
-| 1.8615        | 0.0406 | 300  | 1.6849          |
-| 1.7728        | 0.0433 | 320  | 1.6832          |
-| 1.806         | 0.0461 | 340  | 1.6824          |
-| 1.8843        | 0.0488 | 360  | 1.6812          |
-| 1.7655        | 0.0515 | 380  | 1.6803          |
-| 1.812         | 0.0542 | 400  | 1.6795          |
-| 1.8058        | 0.0569 | 420  | 1.6779          |
-| 1.7424        | 0.0596 | 440  | 1.6779          |
-| 1.8976        | 0.0623 | 460  | 1.6782          |
-| 1.8237        | 0.0650 | 480  | 1.6778          |
-| 1.8981        | 0.0677 | 500  | 1.6761          |
 ### Framework versions

 This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the generator dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.6759
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
 | 1.9468        | 0.0027 | 20   | 1.8141          |
+| 1.8734        | 0.0054 | 40   | 1.7849          |
 | 1.8769        | 0.0081 | 60   | 1.7718          |
+| 1.8633        | 0.0108 | 80   | 1.7599          |
+| 1.8583        | 0.0135 | 100  | 1.7472          |
+| 1.8264        | 0.0163 | 120  | 1.7176          |
+| 1.8714        | 0.0190 | 140  | 1.7053          |
+| 1.831         | 0.0217 | 160  | 1.7012          |
+| 1.7957        | 0.0244 | 180  | 1.6947          |
+| 1.8613        | 0.0271 | 200  | 1.6934          |
 | 1.81          | 0.0298 | 220  | 1.6915          |
+| 1.7995        | 0.0325 | 240  | 1.6893          |
+| 1.9067        | 0.0352 | 260  | 1.6872          |
+| 1.8261        | 0.0379 | 280  | 1.6860          |
+| 1.8609        | 0.0406 | 300  | 1.6843          |
+| 1.7725        | 0.0433 | 320  | 1.6835          |
+| 1.8061        | 0.0461 | 340  | 1.6819          |
+| 1.8842        | 0.0488 | 360  | 1.6804          |
+| 1.7648        | 0.0515 | 380  | 1.6799          |
+| 1.8121        | 0.0542 | 400  | 1.6796          |
+| 1.8056        | 0.0569 | 420  | 1.6777          |
+| 1.7423        | 0.0596 | 440  | 1.6780          |
+| 1.8971        | 0.0623 | 460  | 1.6782          |
+| 1.8234        | 0.0650 | 480  | 1.6771          |
+| 1.8978        | 0.0677 | 500  | 1.6759          |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c43dc04532b014d1c7de792f57ae7b7d5a3f8e6bd497a9677993c63d5ee08096
 size 134235048

 version https://git-lfs.github.com/spec/v1
+oid sha256:311ca013ee3478a8b5594c05f6312e32c1a788788fbfac207b6914e873bea033
 size 134235048

runs/Sep26_13-42-54_3d63c6bcbcfd/events.out.tfevents.1727358175.3d63c6bcbcfd.3916.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:2f8aa56e0ed8aeb8b9fe65753f25dc6673bdbfef547196ed138c9db603ad9277
+size 23452

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0d7d05ef4184ef5384d4cbcc938ea6aefb352a1c32c3276f279e3f32f1fe21f7
 size 5496

 version https://git-lfs.github.com/spec/v1
+oid sha256:837ee62862faf7e261a706baf6d07f66d3d7cadc6beb5e43fc4a5fc76ec3d4c4
 size 5496