strickvl
/

isafpr-tiny-llama-lora

Generated from Trainer

4-bit precision

Model card Files Files and versions Community

strickvl commited on Jun 13, 2024

Commit

d0713fa

·

verified ·

1 Parent(s): 14cca0c

End of training

Files changed (2) hide show

README.md +16 -18
adapter_model.bin +1 -1

README.md CHANGED Viewed

@@ -96,7 +96,7 @@ special_tokens:
 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0212
 ## Model description
@@ -133,23 +133,21 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| 0.8068        | 0.0227 | 1    | 0.8529          |
-| 0.4759        | 0.25   | 11   | 0.4152          |
-| 0.0851        | 0.5    | 22   | 0.0833          |
-| 0.0385        | 0.75   | 33   | 0.0434          |
-| 0.0321        | 1.0    | 44   | 0.0365          |
-| 0.0326        | 1.1705 | 55   | 0.0315          |
-| 0.1114        | 1.4205 | 66   | 0.0283          |
-| 0.0275        | 1.6705 | 77   | 0.0261          |
-| 0.0282        | 1.9205 | 88   | 0.0246          |
-| 0.0206        | 2.0909 | 99   | 0.0237          |
-| 0.0675        | 2.3409 | 110  | 0.0228          |
-| 0.0201        | 2.5909 | 121  | 0.0222          |
-| 0.0176        | 2.8409 | 132  | 0.0218          |
-| 0.0941        | 3.0114 | 143  | 0.0214          |
-| 0.0262        | 3.2614 | 154  | 0.0213          |
-| 0.051         | 3.5114 | 165  | 0.0213          |
-| 0.0184        | 3.7614 | 176  | 0.0212          |
 ### Framework versions

 This model is a fine-tuned version of [TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T](https://huggingface.co/TinyLlama/TinyLlama-1.1B-intermediate-step-1431k-3T) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0557
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| 1.7724        | 0.0303 | 1    | 1.7779          |
+| 1.2158        | 0.2727 | 9    | 1.0692          |
+| 0.2116        | 0.5455 | 18   | 0.1796          |
+| 0.1051        | 0.8182 | 27   | 0.1048          |
+| 0.0762        | 1.0227 | 36   | 0.0859          |
+| 0.0704        | 1.2955 | 45   | 0.0763          |
+| 0.0661        | 1.5682 | 54   | 0.0692          |
+| 0.073         | 1.8409 | 63   | 0.0646          |
+| 0.0625        | 2.0455 | 72   | 0.0621          |
+| 0.0522        | 2.3182 | 81   | 0.0602          |
+| 0.0472        | 2.5909 | 90   | 0.0580          |
+| 0.0545        | 2.8636 | 99   | 0.0571          |
+| 0.0467        | 3.0682 | 108  | 0.0561          |
+| 0.057         | 3.3409 | 117  | 0.0557          |
+| 0.0477        | 3.6136 | 126  | 0.0557          |
 ### Framework versions

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f999cad7e14f1e3ae89430a9cad5ba5d51d1d035a409ac2f6b22a690a6ef6baa
 size 101036698

 version https://git-lfs.github.com/spec/v1
+oid sha256:8807cbd5b63ed7c6a4e11d0030904f68fe580184ce532161badaa79e40a890f9
 size 101036698