HectorWoods42
/

t5-base-finetuned-xsum

Transformers

PyTorch

text2text-generation

Generated from Trainer

text-generation-inference

Model card Files Files and versions Community

HectorWoods42 commited on Sep 12, 2023

Commit

766a063

1 Parent(s): 6f31b14

End of training

Browse files

Files changed (3) hide show

README.md +106 -0
generation_config.json +7 -0
pytorch_model.bin +1 -1

README.md ADDED Viewed

	@@ -0,0 +1,106 @@

+---
+license: apache-2.0
+base_model: t5-base
+tags:
+- generated_from_trainer
+model-index:
+- name: t5-base-finetuned-xsum
+  results: []
+---
+<!-- This model card has been generated automatically according to the information the Trainer had access to. You
+should probably proofread and complete it, then remove this comment. -->
+# t5-base-finetuned-xsum
+This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
+It achieves the following results on the evaluation set:
+- Loss: 2.0457
+## Model description
+More information needed
+## Intended uses & limitations
+More information needed
+## Training and evaluation data
+More information needed
+## Training procedure
+### Training hyperparameters
+The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 2
+- eval_batch_size: 2
+- seed: 42
+- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
+- lr_scheduler_type: linear
+- num_epochs: 50
+### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 64   | 2.6897          |
+| No log        | 2.0   | 128  | 1.8762          |
+| No log        | 3.0   | 192  | 1.7924          |
+| No log        | 4.0   | 256  | 1.7568          |
+| No log        | 5.0   | 320  | 1.7372          |
+| No log        | 6.0   | 384  | 1.7117          |
+| No log        | 7.0   | 448  | 1.7177          |
+| 1.6685        | 8.0   | 512  | 1.7206          |
+| 1.6685        | 9.0   | 576  | 1.7204          |
+| 1.6685        | 10.0  | 640  | 1.7308          |
+| 1.6685        | 11.0  | 704  | 1.7393          |
+| 1.6685        | 12.0  | 768  | 1.7525          |
+| 1.6685        | 13.0  | 832  | 1.7592          |
+| 1.6685        | 14.0  | 896  | 1.7706          |
+| 1.6685        | 15.0  | 960  | 1.7746          |
+| 0.9956        | 16.0  | 1024 | 1.7864          |
+| 0.9956        | 17.0  | 1088 | 1.7997          |
+| 0.9956        | 18.0  | 1152 | 1.8177          |
+| 0.9956        | 19.0  | 1216 | 1.8332          |
+| 0.9956        | 20.0  | 1280 | 1.8388          |
+| 0.9956        | 21.0  | 1344 | 1.8472          |
+| 0.9956        | 22.0  | 1408 | 1.8600          |
+| 0.9956        | 23.0  | 1472 | 1.8693          |
+| 0.7826        | 24.0  | 1536 | 1.8867          |
+| 0.7826        | 25.0  | 1600 | 1.9002          |
+| 0.7826        | 26.0  | 1664 | 1.9121          |
+| 0.7826        | 27.0  | 1728 | 1.9221          |
+| 0.7826        | 28.0  | 1792 | 1.9273          |
+| 0.7826        | 29.0  | 1856 | 1.9381          |
+| 0.7826        | 30.0  | 1920 | 1.9509          |
+| 0.7826        | 31.0  | 1984 | 1.9587          |
+| 0.6385        | 32.0  | 2048 | 1.9644          |
+| 0.6385        | 33.0  | 2112 | 1.9657          |
+| 0.6385        | 34.0  | 2176 | 1.9783          |
+| 0.6385        | 35.0  | 2240 | 1.9870          |
+| 0.6385        | 36.0  | 2304 | 1.9911          |
+| 0.6385        | 37.0  | 2368 | 1.9959          |
+| 0.6385        | 38.0  | 2432 | 2.0009          |
+| 0.6385        | 39.0  | 2496 | 2.0091          |
+| 0.5653        | 40.0  | 2560 | 2.0190          |
+| 0.5653        | 41.0  | 2624 | 2.0195          |
+| 0.5653        | 42.0  | 2688 | 2.0240          |
+| 0.5653        | 43.0  | 2752 | 2.0308          |
+| 0.5653        | 44.0  | 2816 | 2.0340          |
+| 0.5653        | 45.0  | 2880 | 2.0364          |
+| 0.5653        | 46.0  | 2944 | 2.0414          |
+| 0.5251        | 47.0  | 3008 | 2.0427          |
+| 0.5251        | 48.0  | 3072 | 2.0447          |
+| 0.5251        | 49.0  | 3136 | 2.0452          |
+| 0.5251        | 50.0  | 3200 | 2.0457          |
+### Framework versions
+- Transformers 4.33.0
+- Pytorch 2.0.0
+- Datasets 2.1.0
+- Tokenizers 0.13.3

generation_config.json ADDED Viewed

	@@ -0,0 +1,7 @@

+{
+  "_from_model_config": true,
+  "decoder_start_token_id": 0,
+  "eos_token_id": 1,
+  "pad_token_id": 0,
+  "transformers_version": "4.33.0"
+}

pytorch_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:5a1df65e4038c9e30c2bd0e685f468312186fa87bf77b6b2f5c6a6a8c215f756
 size 891702929

 version https://git-lfs.github.com/spec/v1
+oid sha256:e14945cf6183d2346f9ab7fa01409d03b914f273c3fd99ad8905c4d645abb834
 size 891702929