hiba2 commited on
Commit
d84e1b5
·
verified ·
1 Parent(s): 755ed4d

End of training

Browse files
Files changed (3) hide show
  1. README.md +17 -17
  2. model.safetensors +1 -1
  3. training_args.bin +1 -1
README.md CHANGED
@@ -16,12 +16,12 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [malmarjeh/t5-arabic-text-summarization](https://huggingface.co/malmarjeh/t5-arabic-text-summarization) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.0348
20
- - Rouge1: 0.1242
21
- - Rouge2: 0.0117
22
- - Rougel: 0.1244
23
- - Rougelsum: 0.1241
24
- - Gen Len: 6.9278
25
 
26
  ## Model description
27
 
@@ -40,33 +40,33 @@ More information needed
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
- - learning_rate: 2e-05
44
  - train_batch_size: 1
45
  - eval_batch_size: 1
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
  - lr_scheduler_warmup_steps: 100
50
- - num_epochs: 2
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
- | 0.6242 | 0.23 | 500 | 0.0883 | 0.1237 | 0.0117 | 0.1239 | 0.1237 | 5.1787 |
58
- | 0.5832 | 0.46 | 1000 | 0.0658 | 0.1237 | 0.0117 | 0.1239 | 0.1237 | 5.1787 |
59
- | 0.5007 | 0.7 | 1500 | 0.0554 | 0.1237 | 0.0117 | 0.1239 | 0.1237 | 5.1787 |
60
- | 0.4419 | 0.93 | 2000 | 0.0490 | 0.1237 | 0.0117 | 0.1239 | 0.1237 | 6.0018 |
61
- | 0.3982 | 1.16 | 2500 | 0.0440 | 0.1237 | 0.0117 | 0.1239 | 0.1237 | 5.6931 |
62
- | 0.3671 | 1.39 | 3000 | 0.0383 | 0.1238 | 0.0117 | 0.1239 | 0.1238 | 5.6588 |
63
- | 0.3509 | 1.62 | 3500 | 0.0360 | 0.1242 | 0.0117 | 0.1244 | 0.1241 | 6.8249 |
64
- | 0.3332 | 1.86 | 4000 | 0.0348 | 0.1242 | 0.0117 | 0.1244 | 0.1241 | 6.9278 |
65
 
66
 
67
  ### Framework versions
68
 
69
  - Transformers 4.39.0.dev0
70
- - Pytorch 2.1.0+cu121
71
  - Datasets 2.18.0
72
  - Tokenizers 0.15.2
 
16
 
17
  This model is a fine-tuned version of [malmarjeh/t5-arabic-text-summarization](https://huggingface.co/malmarjeh/t5-arabic-text-summarization) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.0104
20
+ - Rouge1: 0.1382
21
+ - Rouge2: 0.0187
22
+ - Rougel: 0.1382
23
+ - Rougelsum: 0.1382
24
+ - Gen Len: 18.9404
25
 
26
  ## Model description
27
 
 
40
  ### Training hyperparameters
41
 
42
  The following hyperparameters were used during training:
43
+ - learning_rate: 0.0005
44
  - train_batch_size: 1
45
  - eval_batch_size: 1
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
  - lr_scheduler_warmup_steps: 100
50
+ - num_epochs: 5
51
  - mixed_precision_training: Native AMP
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
56
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:-------:|
57
+ | 0.0338 | 0.23 | 500 | 0.0175 | 0.1514 | 0.0297 | 0.1511 | 0.1518 | 18.9188 |
58
+ | 0.0566 | 0.46 | 1000 | 0.0161 | 0.1565 | 0.0388 | 0.157 | 0.1573 | 18.9188 |
59
+ | 0.0418 | 0.7 | 1500 | 0.0125 | 0.1372 | 0.0199 | 0.1375 | 0.1379 | 18.8105 |
60
+ | 0.0333 | 0.93 | 2000 | 0.0116 | 0.1443 | 0.0253 | 0.1448 | 0.1448 | 18.8051 |
61
+ | 0.0287 | 1.16 | 2500 | 0.0110 | 0.144 | 0.0192 | 0.1442 | 0.1442 | 19.0 |
62
+ | 0.0247 | 1.39 | 3000 | 0.0096 | 0.1511 | 0.024 | 0.1517 | 0.1518 | 19.0 |
63
+ | 0.0219 | 1.62 | 3500 | 0.0087 | 0.1463 | 0.0241 | 0.1462 | 0.1462 | 18.9747 |
64
+ | 0.021 | 1.86 | 4000 | 0.0104 | 0.1382 | 0.0187 | 0.1382 | 0.1382 | 18.9404 |
65
 
66
 
67
  ### Framework versions
68
 
69
  - Transformers 4.39.0.dev0
70
+ - Pytorch 2.2.1+cu121
71
  - Datasets 2.18.0
72
  - Tokenizers 0.15.2
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8d4fd098b0b4b9fae8065e559805f7cf706e29cdfd39754f0acf1cbfe759acc6
3
  size 1131116304
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85b19a78ac36c127022787c06979316034e323ee6bdbe6c5711bf15576649e48
3
  size 1131116304
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:30e67af8cb9557cb8ec34c7f431c3d3bbb0da6d6138b61d0bb80257815b13681
3
  size 4984
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19f9e9c5d0720a58f78acdecf0f30916dd13d8dbbc2353309c35f535efad14fb
3
  size 4984