End of training

Files changed (5) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7762
 ## Model description
@@ -34,28 +34,33 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 2e-05
-- train_batch_size: 24
-- eval_batch_size: 24
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.5281        | 1.0   | 14   | 4.2174          |
-| 4.0533        | 2.0   | 28   | 3.9744          |
-| 3.789         | 3.0   | 42   | 3.8885          |
-| 3.6531        | 4.0   | 56   | 3.8412          |
-| 3.4773        | 5.0   | 70   | 3.8131          |
-| 3.3368        | 6.0   | 84   | 3.7965          |
-| 3.232         | 7.0   | 98   | 3.7848          |
-| 3.185         | 8.0   | 112  | 3.7803          |
-| 3.1228        | 9.0   | 126  | 3.7775          |
-| 3.1037        | 10.0  | 140  | 3.7762          |
 ### Framework versions

 This model is a fine-tuned version of [DeepESP/gpt2-spanish](https://huggingface.co/DeepESP/gpt2-spanish) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.9312
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1e-05
+- train_batch_size: 64
+- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 5.7424        | 1.0   | 5    | 5.0220          |
+| 4.7253        | 2.0   | 10   | 4.5156          |
+| 4.4643        | 3.0   | 15   | 4.3808          |
+| 4.3235        | 4.0   | 20   | 4.2740          |
+| 4.2015        | 5.0   | 25   | 4.1731          |
+| 4.0779        | 6.0   | 30   | 4.0667          |
+| 3.9722        | 7.0   | 35   | 4.0160          |
+| 3.9136        | 8.0   | 40   | 3.9975          |
+| 3.878         | 9.0   | 45   | 3.9820          |
+| 3.8465        | 10.0  | 50   | 3.9675          |
+| 3.8029        | 11.0  | 55   | 3.9552          |
+| 3.7845        | 12.0  | 60   | 3.9454          |
+| 3.7639        | 13.0  | 65   | 3.9383          |
+| 3.7473        | 14.0  | 70   | 3.9337          |
+| 3.7358        | 15.0  | 75   | 3.9312          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d07f09c2fb78aa521581c50d06b1ca0006718a534769fb0a788dfc9c360d296f
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:f0aee331a1df0a3bcb89109fcbda26ebf3bee40b0a5f6293f5ddd9e29a446d14
 size 497774208

runs/Nov25_15-05-07_b6f9b399bc40/events.out.tfevents.1700924716.b6f9b399bc40.1277.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:103df4bad2e143a7f3c7b35248c8e28ebda03612b8af22164182778dd53e6932
+size 13482

runs/Nov25_15-06-14_b6f9b399bc40/events.out.tfevents.1700924781.b6f9b399bc40.1277.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:5550684a4983cb1a56da51737f88976c134e8195be0a2b4146193a4f4461a9d9
+size 11136

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3848e80dbfa5c25ee3064ee94d9ffa7b0c31173c2308724d0662bec05bf51f81
 size 4536

 version https://git-lfs.github.com/spec/v1
+oid sha256:d6fa55a7ae9979835533aa63f6064de1ebcaeb974fb2ac365541e3d7b0a19dc8
 size 4536