Update README.md
Browse files
README.md
CHANGED
@@ -64,15 +64,20 @@ inference:
|
|
64 |
|
65 |
---
|
66 |
|
67 |
-
# Longformer Encoder-Decoder (LED)
|
68 |
|
|
|
69 |
- **Use cases:** long narrative summarization (think stories - as the dataset intended), article/paper/textbook/other summarization, technical:simple summarization.
|
70 |
- Models trained on this dataset tend to also _explain_ what they are summarizing, which IMO is awesome.
|
|
|
|
|
|
|
|
|
|
|
71 |
- Trained for 16 epochs vs. [`pszemraj/led-base-16384-finetuned-booksum`](https://huggingface.co/pszemraj/led-base-16384-finetuned-booksum),
|
72 |
|
73 |
- parameters adjusted for _very_ fine-tuning type training (super low LR, etc)
|
74 |
- all the parameters for generation on the API are the same for easy comparison between versions.
|
75 |
-
- works well on lots of text, and can hand 16384 tokens/batch.
|
76 |
|
77 |
## Other Checkpoints on Booksum
|
78 |
|
|
|
64 |
|
65 |
---
|
66 |
|
67 |
+
# Longformer Encoder-Decoder (LED) for Narrative-Esque Long Text Summarization
|
68 |
|
69 |
+
- **What:** This is the (current) result of the quest for a summarization model that condenses technical/long information down well _in general, academic and narrative usage
|
70 |
- **Use cases:** long narrative summarization (think stories - as the dataset intended), article/paper/textbook/other summarization, technical:simple summarization.
|
71 |
- Models trained on this dataset tend to also _explain_ what they are summarizing, which IMO is awesome.
|
72 |
+
|
73 |
+
- works well on lots of text, and can hand 16384 tokens/batch.
|
74 |
+
|
75 |
+
## About
|
76 |
+
|
77 |
- Trained for 16 epochs vs. [`pszemraj/led-base-16384-finetuned-booksum`](https://huggingface.co/pszemraj/led-base-16384-finetuned-booksum),
|
78 |
|
79 |
- parameters adjusted for _very_ fine-tuning type training (super low LR, etc)
|
80 |
- all the parameters for generation on the API are the same for easy comparison between versions.
|
|
|
81 |
|
82 |
## Other Checkpoints on Booksum
|
83 |
|