welsachy commited on Jun 12, 2024

Commit

66e0e46

verified ·

1 Parent(s): 9ce5d7b

End of training

Browse files

Files changed (19) hide show

README.md +18 -16
config.json +1 -1
generation_config.json +1 -1
model.safetensors +1 -1
runs/Jun12_18-43-10_688f4ceef715/events.out.tfevents.1718217815.688f4ceef715.355.0 +3 -0
runs/Jun12_18-44-27_688f4ceef715/events.out.tfevents.1718217874.688f4ceef715.355.1 +3 -0
runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718217897.688f4ceef715.355.2 +3 -0
runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718218014.688f4ceef715.355.3 +3 -0
runs/Jun12_18-47-20_688f4ceef715/events.out.tfevents.1718218048.688f4ceef715.355.4 +3 -0
runs/Jun12_18-48-11_688f4ceef715/events.out.tfevents.1718218098.688f4ceef715.355.5 +3 -0
runs/Jun12_18-48-33_688f4ceef715/events.out.tfevents.1718218120.688f4ceef715.355.6 +3 -0
runs/Jun12_18-53-00_688f4ceef715/events.out.tfevents.1718218390.688f4ceef715.355.7 +3 -0
runs/Jun12_18-53-47_688f4ceef715/events.out.tfevents.1718218433.688f4ceef715.355.8 +3 -0
runs/Jun12_19-06-12_688f4ceef715/events.out.tfevents.1718219192.688f4ceef715.355.9 +3 -0
runs/Jun12_19-06-48_688f4ceef715/events.out.tfevents.1718219214.688f4ceef715.355.10 +3 -0
runs/Jun12_19-08-47_688f4ceef715/events.out.tfevents.1718219349.688f4ceef715.355.11 +3 -0
runs/Jun12_19-10-06_688f4ceef715/events.out.tfevents.1718219425.688f4ceef715.355.12 +3 -0
runs/Jun12_19-10-45_688f4ceef715/events.out.tfevents.1718219467.688f4ceef715.355.13 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -21,16 +21,16 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.3483
-- Rouge1: 67.6667
-- Rouge2: 21.8889
-- Rougel: 67.6667
-- Rougelsum: 67.6667
-- Gen Len: 2.2944
-- Precision: 0.6767
-- Recall: 0.6767
-- F1: 0.6767
-- Accuracy: 0.6767
 ## Model description
@@ -55,21 +55,23 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:---------:|:------:|:------:|:--------:|
-| 0.3908        | 1.0   | 1199 | 0.3346          | 67.1111 | 20.0    | 67.3333 | 67.2222   | 2.2578  | 0.6722    | 0.6722 | 0.6722 | 0.6722   |
-| 0.3601        | 2.0   | 2398 | 0.3582          | 67.4444 | 18.5556 | 67.5556 | 67.6667   | 2.2256  | 0.6756    | 0.6756 | 0.6756 | 0.6756   |
-| 0.3289        | 3.0   | 3597 | 0.3483          | 67.6667 | 21.8889 | 67.6667 | 67.6667   | 2.2944  | 0.6767    | 0.6767 | 0.6767 | 0.6767   |
 ### Framework versions
-- Transformers 4.41.1
 - Pytorch 2.3.0+cu121
-- Datasets 2.19.1
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [google-t5/t5-base](https://huggingface.co/google-t5/t5-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.2567
+- Rouge1: 73.3333
+- Rouge2: 26.6667
+- Rougel: 73.3333
+- Rougelsum: 73.3333
+- Gen Len: 2.3167
+- Precision: 0.7333
+- Recall: 0.7333
+- F1: 0.7333
+- Accuracy: 0.7333
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|:---------:|:------:|:------:|:--------:|
+| No log        | 1.0   | 80   | 0.4117          | 51.6667 | 31.6667 | 51.6667 | 51.6667   | 2.5667  | 0.5167    | 0.5167 | 0.5167 | 0.5167   |
+| No log        | 2.0   | 160  | 0.3237          | 66.6667 | 26.6667 | 66.6667 | 66.6667   | 2.3167  | 0.6667    | 0.6667 | 0.6667 | 0.6667   |
+| No log        | 3.0   | 240  | 0.3069          | 68.3333 | 28.3333 | 68.3333 | 68.3333   | 2.4     | 0.6833    | 0.6833 | 0.6833 | 0.6833   |
+| No log        | 4.0   | 320  | 0.2675          | 73.3333 | 26.6667 | 73.3333 | 73.3333   | 2.3167  | 0.7333    | 0.7333 | 0.7333 | 0.7333   |
+| No log        | 5.0   | 400  | 0.2567          | 73.3333 | 26.6667 | 73.3333 | 73.3333   | 2.3167  | 0.7333    | 0.7333 | 0.7333 | 0.7333   |
 ### Framework versions
+- Transformers 4.41.2
 - Pytorch 2.3.0+cu121
+- Datasets 2.19.2
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -55,7 +55,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.41.1",
   "use_cache": true,
   "vocab_size": 32128
 }

     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.41.2",
   "use_cache": true,
   "vocab_size": 32128
 }

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
-  "transformers_version": "4.41.1"
 }

   "decoder_start_token_id": 0,
   "eos_token_id": 1,
   "pad_token_id": 0,
+  "transformers_version": "4.41.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1842317db081b622577105e96e1ed0896c90f546adb676e31b30ea982c641941
 size 891644712

 version https://git-lfs.github.com/spec/v1
+oid sha256:e278cd1d44d39a52d34a28a7c0be6263041fac526637cc4831135e44420c2101
 size 891644712

runs/Jun12_18-43-10_688f4ceef715/events.out.tfevents.1718217815.688f4ceef715.355.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42aba1444f830a722e71e35473d1aa9b35527b7891334e171016ec491ed2b9c1
+size 5835

runs/Jun12_18-44-27_688f4ceef715/events.out.tfevents.1718217874.688f4ceef715.355.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f1bdec938dc7336c8c7a1f8d71cb7d3b8cc7eb728ede4a4bd3940d6d3d06e3f8
+size 5833

runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718217897.688f4ceef715.355.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:33839cc9051564f1ac6a56869329a6b214b720dbb15fa8e51993972161f703d9
+size 5833

runs/Jun12_18-44-51_688f4ceef715/events.out.tfevents.1718218014.688f4ceef715.355.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c54a8341c91c55e0d14459db3e55b8ab16732085f311453de893947f3a443c11
+size 5833

runs/Jun12_18-47-20_688f4ceef715/events.out.tfevents.1718218048.688f4ceef715.355.4 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:706b39499fe3c3121bd5a61fff19af6be4bb7cedab08779ce35e45235060b771
+size 5833

runs/Jun12_18-48-11_688f4ceef715/events.out.tfevents.1718218098.688f4ceef715.355.5 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:0436ddb5deddf041307da2bcb8787aa357ec2ac70515a4f9c4675722f7b4eedc
+size 5833

runs/Jun12_18-48-33_688f4ceef715/events.out.tfevents.1718218120.688f4ceef715.355.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:f0a59b7a2439ce791c661a130661e2ba5bda316b3cbcbbd3290ebf718cec0aae
+size 5833

runs/Jun12_18-53-00_688f4ceef715/events.out.tfevents.1718218390.688f4ceef715.355.7 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:acfdc30302eee0e37aee72a8972e3bf255c3354c090aa372cdc84dfd3c9ed129
+size 5833

runs/Jun12_18-53-47_688f4ceef715/events.out.tfevents.1718218433.688f4ceef715.355.8 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b073bac1c548c1e0b4e37754fdf20473b47fab4664f8c43b5f9e170a7faaccc0
+size 5833

runs/Jun12_19-06-12_688f4ceef715/events.out.tfevents.1718219192.688f4ceef715.355.9 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5eb50651fd0f5a05055e30ec1918e4cbe6d30163d26956b271cffa02b7f8bc45
+size 5833

runs/Jun12_19-06-48_688f4ceef715/events.out.tfevents.1718219214.688f4ceef715.355.10 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:eb9cd44862249c3f39154f83755c4edd67bb56bd38d884d35b7ea983b2a04eca
+size 5833

runs/Jun12_19-08-47_688f4ceef715/events.out.tfevents.1718219349.688f4ceef715.355.11 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5d5dc0623283882ff1dfb25beb117839272ac3ee9cb189114c0fd56f144c9146
+size 5833

runs/Jun12_19-10-06_688f4ceef715/events.out.tfevents.1718219425.688f4ceef715.355.12 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:424951c9ec7520bb39e41fa8eef1a7679320c227486f43f911491aecf48dced4
+size 5833

runs/Jun12_19-10-45_688f4ceef715/events.out.tfevents.1718219467.688f4ceef715.355.13 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5ed1e0b486fb85793fe381446fcc30a06c070a5a1f9c7ae6af573de6f3a6af54
+size 9803

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4dc04e813518615458cedf4cafc2a271356882a89abab9b41181f5105ba69025
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:5ee04ecb93a666e2d0f67c89c07fba8210fec11746dff43232eb071348c83642
 size 5240