End of training

Files changed (4) hide show

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [moussaKam/AraBART](https://huggingface.co/moussaKam/AraBART) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5748
 ## Model description
@@ -36,13 +36,13 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
@@ -56,6 +56,16 @@ The following hyperparameters were used during training:
 | 0.6767        | 2.1313 | 2500 | 0.5775          |
 | 0.6561        | 2.5575 | 3000 | 0.5758          |
 | 0.6562        | 2.9838 | 3500 | 0.5748          |
 ### Framework versions

 This model is a fine-tuned version of [moussaKam/AraBART](https://huggingface.co/moussaKam/AraBART) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.5783
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 128
+- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 8
 - mixed_precision_training: Native AMP
 ### Training results
 | 0.6767        | 2.1313 | 2500 | 0.5775          |
 | 0.6561        | 2.5575 | 3000 | 0.5758          |
 | 0.6562        | 2.9838 | 3500 | 0.5748          |
+| 0.6457        | 3.4101 | 4000 | 0.5774          |
+| 0.6519        | 3.8363 | 4500 | 0.5755          |
+| 0.6396        | 4.2626 | 5000 | 0.5774          |
+| 0.626         | 4.6888 | 5500 | 0.5773          |
+| 0.6201        | 5.1151 | 6000 | 0.5789          |
+| 0.605         | 5.5413 | 6500 | 0.5776          |
+| 0.6044        | 5.9676 | 7000 | 0.5770          |
+| 0.5899        | 6.3939 | 7500 | 0.5786          |
+| 0.5917        | 6.8201 | 8000 | 0.5779          |
+| 0.5913        | 7.2464 | 8500 | 0.5783          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:d8dae86a034860d617b03ea1a76145a906f31adf2bcf2a1ad1aad5e02b328ab1
 size 557116312

 version https://git-lfs.github.com/spec/v1
+oid sha256:c64406554d8e427efacda687bef2b1b22264c6e153816e5349010b99b5206139
 size 557116312

runs/Sep03_16-08-27_423de3f4d3a8/events.out.tfevents.1725379718.423de3f4d3a8.820.5 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:dd49772425cd4630f980af78036ab03820e1cafd674d4755b3227e1f88e54f1b
+size 10973

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b5cecf1c5608619564d2f95e657d680092c588bd12b1ea1aed4f2d4f0378cd1f
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:6ba230dececc72847b4fd51791627edea2b495b52f70c3b44881aa43d66e7bb0
 size 5240