README - long-t5-tglobal-base-16384-booksum-V11-big_patent-V2
- this README was added because there wasn't one
- created 2022-07-31_12-14-50
about
An experiment testing some transfer learning with pszemraj/long-t5-tglobal-base-16384-book-summary to evaluate the ability to learn some technical documentation through the big_patent
dataset on huggingface.
This checkpoint has been trained on dataset subsection y
of big_patent
for approx 400 steps of functional batch size 128.
- Downloads last month
- 134
Inference Providers
NEW
This model is not currently available via any of the supported third-party Inference Providers, and
the model is not deployed on the HF Inference API.
Datasets used to train pszemraj/long-t5-tglobal-base-16384-booksum-V11-big_patent-V2
Space using pszemraj/long-t5-tglobal-base-16384-booksum-V11-big_patent-V2 1
Evaluation results
- ROUGE-1 on kmfoda/booksumtest set verified23.144
- ROUGE-2 on kmfoda/booksumtest set verified3.239
- ROUGE-L on kmfoda/booksumtest set verified12.704
- ROUGE-LSUM on kmfoda/booksumtest set verified19.810
- loss on kmfoda/booksumtest set verified2.766
- gen_len on kmfoda/booksumtest set verified63.449
- ROUGE-1 on samsumtest set verified26.803
- ROUGE-2 on samsumtest set verified6.066
- ROUGE-L on samsumtest set verified20.010
- ROUGE-LSUM on samsumtest set verified21.912