CodeIsAbstract
/

denoice-finetuned-xsum

@@ -17,12 +17,12 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.7699
-- Rouge1: 75.4057
-- Rouge2: 60.8372
-- Rougel: 75.2813
-- Rougelsum: 75.3388
-- Gen Len: 17.5157
 ## Model description
@@ -47,37 +47,17 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 25
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
-| No log        | 1.0   | 76   | 0.8420          | 72.7754 | 57.4787 | 72.7926 | 72.73     | 17.6387 |
-| No log        | 2.0   | 152  | 0.8339          | 73.0721 | 57.2706 | 73.0236 | 72.9623   | 17.6178 |
-| No log        | 3.0   | 228  | 0.8351          | 73.6099 | 57.967  | 73.5666 | 73.4951   | 17.6623 |
-| No log        | 4.0   | 304  | 0.8239          | 73.5357 | 57.6669 | 73.4526 | 73.5157   | 17.5916 |
-| No log        | 5.0   | 380  | 0.8164          | 73.7228 | 58.0556 | 73.6791 | 73.69     | 17.5969 |
-| No log        | 6.0   | 456  | 0.8133          | 74.1346 | 58.755  | 74.0915 | 74.1094   | 17.6021 |
-| 1.1888        | 7.0   | 532  | 0.8080          | 74.2063 | 58.7052 | 74.1442 | 74.2247   | 17.5969 |
-| 1.1888        | 8.0   | 608  | 0.8073          | 74.0301 | 58.5081 | 73.9849 | 74.0378   | 17.5733 |
-| 1.1888        | 9.0   | 684  | 0.8043          | 74.2651 | 58.8463 | 74.2069 | 74.284    | 17.5785 |
-| 1.1888        | 10.0  | 760  | 0.8003          | 74.4637 | 59.2146 | 74.4277 | 74.5006   | 17.5838 |
-| 1.1888        | 11.0  | 836  | 0.7933          | 74.5048 | 59.1095 | 74.4329 | 74.4965   | 17.5707 |
-| 1.1888        | 12.0  | 912  | 0.7904          | 74.8337 | 59.4889 | 74.7667 | 74.8005   | 17.5733 |
-| 1.1888        | 13.0  | 988  | 0.7879          | 75.0519 | 59.7797 | 74.937  | 75.0109   | 17.5785 |
-| 1.1259        | 14.0  | 1064 | 0.7876          | 75.2269 | 60.0994 | 75.1242 | 75.1981   | 17.5628 |
-| 1.1259        | 15.0  | 1140 | 0.7779          | 75.1229 | 59.887  | 75.0029 | 75.0717   | 17.5445 |
-| 1.1259        | 16.0  | 1216 | 0.7768          | 75.461  | 60.4297 | 75.255  | 75.3054   | 17.5393 |
-| 1.1259        | 17.0  | 1292 | 0.7757          | 75.5466 | 60.7191 | 75.397  | 75.4456   | 17.5497 |
-| 1.1259        | 18.0  | 1368 | 0.7780          | 75.6079 | 60.7743 | 75.4396 | 75.5341   | 17.5497 |
-| 1.1259        | 19.0  | 1444 | 0.7730          | 75.362  | 60.6829 | 75.1996 | 75.2653   | 17.5393 |
-| 1.0939        | 20.0  | 1520 | 0.7739          | 75.4207 | 60.7544 | 75.2581 | 75.3047   | 17.5393 |
-| 1.0939        | 21.0  | 1596 | 0.7730          | 75.363  | 60.7235 | 75.2341 | 75.2882   | 17.5471 |
-| 1.0939        | 22.0  | 1672 | 0.7713          | 75.3809 | 60.7759 | 75.2489 | 75.3012   | 17.5262 |
-| 1.0939        | 23.0  | 1748 | 0.7695          | 75.4104 | 60.7961 | 75.2757 | 75.3183   | 17.5157 |
-| 1.0939        | 24.0  | 1824 | 0.7705          | 75.3482 | 60.8028 | 75.2489 | 75.2857   | 17.5157 |
-| 1.0939        | 25.0  | 1900 | 0.7699          | 75.4057 | 60.8372 | 75.2813 | 75.3388   | 17.5157 |
 ### Framework versions

 This model is a fine-tuned version of [google/t5-efficient-tiny](https://huggingface.co/google/t5-efficient-tiny) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7541
+- Rouge1: 76.26
+- Rouge2: 61.8085
+- Rougel: 76.1635
+- Rougelsum: 76.1928
+- Gen Len: 17.4843
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Rouge1  | Rouge2  | Rougel  | Rougelsum | Gen Len |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|:-------:|:---------:|:-------:|
+| No log        | 1.0   | 76   | 0.7604          | 75.9742 | 61.5113 | 75.8301 | 75.8438   | 17.4817 |
+| No log        | 2.0   | 152  | 0.7574          | 75.9172 | 61.56   | 75.7901 | 75.8489   | 17.4817 |
+| No log        | 3.0   | 228  | 0.7568          | 76.382  | 61.5593 | 76.1883 | 76.2735   | 17.4791 |
+| No log        | 4.0   | 304  | 0.7565          | 76.3074 | 61.8211 | 76.1848 | 76.2148   | 17.4843 |
+| No log        | 5.0   | 380  | 0.7541          | 76.26   | 61.8085 | 76.1635 | 76.1928   | 17.4843 |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:7683fb024ee9fff85c5557d5cd1fd60ecc0f198a4049e3a079eecc4941c3d398
 size 62293080

 version https://git-lfs.github.com/spec/v1
+oid sha256:a929b5241809bd9494bc24d8a7137029e559bdb1740b880225fda85e7a6d908d
 size 62293080

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:bb76e5009502d132e438e5be5d8f8e3baf15910c30dffd5bf02d1005cdb83d9b
 size 4411

 version https://git-lfs.github.com/spec/v1
+oid sha256:df581ca3c17d869459b700204192cd2b8ab5a9e278b96143fc345268ffff25e3
 size 4411