End of training
Browse files- README.md +18 -16
- config.json +1 -1
- generation_config.json +1 -1
- model.safetensors +1 -1
- runs/Jun12_18-43-10_688f4ceef715/events.out.tfevents.1718217815.688f4ceef715.355.0 +3 -0
- runs/Jun12_18-44-27_688f4ceef715/events.out.tfevents.1718217874.688f4ceef715.355.1 +3 -0
- runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718217897.688f4ceef715.355.2 +3 -0
- runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718218014.688f4ceef715.355.3 +3 -0
- runs/Jun12_18-47-20_688f4ceef715/events.out.tfevents.1718218048.688f4ceef715.355.4 +3 -0
- runs/Jun12_18-48-11_688f4ceef715/events.out.tfevents.1718218098.688f4ceef715.355.5 +3 -0
- runs/Jun12_18-48-33_688f4ceef715/events.out.tfevents.1718218120.688f4ceef715.355.6 +3 -0
- runs/Jun12_18-53-00_688f4ceef715/events.out.tfevents.1718218390.688f4ceef715.355.7 +3 -0
- runs/Jun12_18-53-47_688f4ceef715/events.out.tfevents.1718218433.688f4ceef715.355.8 +3 -0
- runs/Jun12_19-06-12_688f4ceef715/events.out.tfevents.1718219192.688f4ceef715.355.9 +3 -0
- runs/Jun12_19-06-48_688f4ceef715/events.out.tfevents.1718219214.688f4ceef715.355.10 +3 -0
- runs/Jun12_19-08-47_688f4ceef715/events.out.tfevents.1718219349.688f4ceef715.355.11 +3 -0
- runs/Jun12_19-10-06_688f4ceef715/events.out.tfevents.1718219425.688f4ceef715.355.12 +3 -0
- runs/Jun12_19-10-45_688f4ceef715/events.out.tfevents.1718219467.688f4ceef715.355.13 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -21,16 +21,16 @@ should probably proofread and complete it, then remove this comment. -->
|
|
21 |
|
22 |
This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
|
23 |
It achieves the following results on the evaluation set:
|
24 |
-
- Loss: 0.
|
25 |
-
- Rouge1:
|
26 |
-
- Rouge2:
|
27 |
-
- Rougel:
|
28 |
-
- Rougelsum:
|
29 |
-
- Gen Len: 2.
|
30 |
-
- Precision: 0.
|
31 |
-
- Recall: 0.
|
32 |
-
- F1: 0.
|
33 |
-
- Accuracy: 0.
|
34 |
|
35 |
## Model description
|
36 |
|
@@ -55,21 +55,23 @@ The following hyperparameters were used during training:
|
|
55 |
- seed: 42
|
56 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
57 |
- lr_scheduler_type: linear
|
58 |
-
- num_epochs:
|
59 |
- mixed_precision_training: Native AMP
|
60 |
|
61 |
### Training results
|
62 |
|
63 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len | Precision | Recall | F1 | Accuracy |
|
64 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:---------:|:------:|:------:|:--------:|
|
65 |
-
|
|
66 |
-
|
|
67 |
-
|
|
|
|
|
|
68 |
|
69 |
|
70 |
### Framework versions
|
71 |
|
72 |
-
- Transformers 4.41.
|
73 |
- Pytorch 2.3.0+cu121
|
74 |
-
- Datasets 2.19.
|
75 |
- Tokenizers 0.19.1
|
|
|
21 |
|
22 |
This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
|
23 |
It achieves the following results on the evaluation set:
|
24 |
+
- Loss: 0.2567
|
25 |
+
- Rouge1: 73.3333
|
26 |
+
- Rouge2: 26.6667
|
27 |
+
- Rougel: 73.3333
|
28 |
+
- Rougelsum: 73.3333
|
29 |
+
- Gen Len: 2.3167
|
30 |
+
- Precision: 0.7333
|
31 |
+
- Recall: 0.7333
|
32 |
+
- F1: 0.7333
|
33 |
+
- Accuracy: 0.7333
|
34 |
|
35 |
## Model description
|
36 |
|
|
|
55 |
- seed: 42
|
56 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
57 |
- lr_scheduler_type: linear
|
58 |
+
- num_epochs: 5
|
59 |
- mixed_precision_training: Native AMP
|
60 |
|
61 |
### Training results
|
62 |
|
63 |
| Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len | Precision | Recall | F1 | Accuracy |
|
64 |
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:---------:|:------:|:------:|:--------:|
|
65 |
+
| No log | 1.0 | 80 | 0.4117 | 51.6667 | 31.6667 | 51.6667 | 51.6667 | 2.5667 | 0.5167 | 0.5167 | 0.5167 | 0.5167 |
|
66 |
+
| No log | 2.0 | 160 | 0.3237 | 66.6667 | 26.6667 | 66.6667 | 66.6667 | 2.3167 | 0.6667 | 0.6667 | 0.6667 | 0.6667 |
|
67 |
+
| No log | 3.0 | 240 | 0.3069 | 68.3333 | 28.3333 | 68.3333 | 68.3333 | 2.4 | 0.6833 | 0.6833 | 0.6833 | 0.6833 |
|
68 |
+
| No log | 4.0 | 320 | 0.2675 | 73.3333 | 26.6667 | 73.3333 | 73.3333 | 2.3167 | 0.7333 | 0.7333 | 0.7333 | 0.7333 |
|
69 |
+
| No log | 5.0 | 400 | 0.2567 | 73.3333 | 26.6667 | 73.3333 | 73.3333 | 2.3167 | 0.7333 | 0.7333 | 0.7333 | 0.7333 |
|
70 |
|
71 |
|
72 |
### Framework versions
|
73 |
|
74 |
+
- Transformers 4.41.2
|
75 |
- Pytorch 2.3.0+cu121
|
76 |
+
- Datasets 2.19.2
|
77 |
- Tokenizers 0.19.1
|
config.json
CHANGED
@@ -55,7 +55,7 @@
|
|
55 |
}
|
56 |
},
|
57 |
"torch_dtype": "float32",
|
58 |
-
"transformers_version": "4.41.
|
59 |
"use_cache": true,
|
60 |
"vocab_size": 32128
|
61 |
}
|
|
|
55 |
}
|
56 |
},
|
57 |
"torch_dtype": "float32",
|
58 |
+
"transformers_version": "4.41.2",
|
59 |
"use_cache": true,
|
60 |
"vocab_size": 32128
|
61 |
}
|
generation_config.json
CHANGED
@@ -2,5 +2,5 @@
|
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
4 |
"pad_token_id": 0,
|
5 |
-
"transformers_version": "4.41.
|
6 |
}
|
|
|
2 |
"decoder_start_token_id": 0,
|
3 |
"eos_token_id": 1,
|
4 |
"pad_token_id": 0,
|
5 |
+
"transformers_version": "4.41.2"
|
6 |
}
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 891644712
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e278cd1d44d39a52d34a28a7c0be6263041fac526637cc4831135e44420c2101
|
3 |
size 891644712
|
runs/Jun12_18-43-10_688f4ceef715/events.out.tfevents.1718217815.688f4ceef715.355.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:42aba1444f830a722e71e35473d1aa9b35527b7891334e171016ec491ed2b9c1
|
3 |
+
size 5835
|
runs/Jun12_18-44-27_688f4ceef715/events.out.tfevents.1718217874.688f4ceef715.355.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f1bdec938dc7336c8c7a1f8d71cb7d3b8cc7eb728ede4a4bd3940d6d3d06e3f8
|
3 |
+
size 5833
|
runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718217897.688f4ceef715.355.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:33839cc9051564f1ac6a56869329a6b214b720dbb15fa8e51993972161f703d9
|
3 |
+
size 5833
|
runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718218014.688f4ceef715.355.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c54a8341c91c55e0d14459db3e55b8ab16732085f311453de893947f3a443c11
|
3 |
+
size 5833
|
runs/Jun12_18-47-20_688f4ceef715/events.out.tfevents.1718218048.688f4ceef715.355.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:706b39499fe3c3121bd5a61fff19af6be4bb7cedab08779ce35e45235060b771
|
3 |
+
size 5833
|
runs/Jun12_18-48-11_688f4ceef715/events.out.tfevents.1718218098.688f4ceef715.355.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0436ddb5deddf041307da2bcb8787aa357ec2ac70515a4f9c4675722f7b4eedc
|
3 |
+
size 5833
|
runs/Jun12_18-48-33_688f4ceef715/events.out.tfevents.1718218120.688f4ceef715.355.6
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f0a59b7a2439ce791c661a130661e2ba5bda316b3cbcbbd3290ebf718cec0aae
|
3 |
+
size 5833
|
runs/Jun12_18-53-00_688f4ceef715/events.out.tfevents.1718218390.688f4ceef715.355.7
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:acfdc30302eee0e37aee72a8972e3bf255c3354c090aa372cdc84dfd3c9ed129
|
3 |
+
size 5833
|
runs/Jun12_18-53-47_688f4ceef715/events.out.tfevents.1718218433.688f4ceef715.355.8
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b073bac1c548c1e0b4e37754fdf20473b47fab4664f8c43b5f9e170a7faaccc0
|
3 |
+
size 5833
|
runs/Jun12_19-06-12_688f4ceef715/events.out.tfevents.1718219192.688f4ceef715.355.9
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5eb50651fd0f5a05055e30ec1918e4cbe6d30163d26956b271cffa02b7f8bc45
|
3 |
+
size 5833
|
runs/Jun12_19-06-48_688f4ceef715/events.out.tfevents.1718219214.688f4ceef715.355.10
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:eb9cd44862249c3f39154f83755c4edd67bb56bd38d884d35b7ea983b2a04eca
|
3 |
+
size 5833
|
runs/Jun12_19-08-47_688f4ceef715/events.out.tfevents.1718219349.688f4ceef715.355.11
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5d5dc0623283882ff1dfb25beb117839272ac3ee9cb189114c0fd56f144c9146
|
3 |
+
size 5833
|
runs/Jun12_19-10-06_688f4ceef715/events.out.tfevents.1718219425.688f4ceef715.355.12
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:424951c9ec7520bb39e41fa8eef1a7679320c227486f43f911491aecf48dced4
|
3 |
+
size 5833
|
runs/Jun12_19-10-45_688f4ceef715/events.out.tfevents.1718219467.688f4ceef715.355.13
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5ed1e0b486fb85793fe381446fcc30a06c070a5a1f9c7ae6af573de6f3a6af54
|
3 |
+
size 9803
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 5240
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5ee04ecb93a666e2d0f67c89c07fba8210fec11746dff43232eb071348c83642
|
3 |
size 5240
|