Harsh Trivedi
commited on
Commit
·
6377f16
1
Parent(s):
4ebe5ea
update.
Browse files
README.md
CHANGED
@@ -13,9 +13,9 @@ This paper proposes a procedure to synthetically generate a QA dataset, TeaBReaC
|
|
13 |
|
14 |
We release the following models:
|
15 |
|
16 |
-
- **A:** Base Models finetuned on target datasets: `{
|
17 |
- **B:** Base models pretrained on TeaBReaC: `teabreac-{base_model}`
|
18 |
-
- **C:** Base models pretrained on TeaBReaC and then finetuned on target datasets: `teabreac-{
|
19 |
|
20 |
The `base_model` above can be from: `bart-large`, `t5-large`, `t5-3b`, `nt5-small`, `preasm-large`.
|
21 |
The `target_dataset` above can be from: `drop`, `tatqa`, `iirc-gold`, `iirc-retrieved`, `numglue`.
|
@@ -27,7 +27,6 @@ The **A** models are only released for completeness / reproducibility. In your e
|
|
27 |
Please checkout the details in our [github repository](https://github.com/stonybrooknlp/teabreac), but in a nutshell:
|
28 |
|
29 |
```python
|
30 |
-
UPDATE_DISCLAIMER
|
31 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
32 |
from digit_tokenization import enable_digit_tokenization # digit_tokenization.py from https://github.com/stonybrooknlp/teabreac
|
33 |
|
|
|
13 |
|
14 |
We release the following models:
|
15 |
|
16 |
+
- **A:** Base Models finetuned on target datasets: `{base_model}-{target_dataset}`
|
17 |
- **B:** Base models pretrained on TeaBReaC: `teabreac-{base_model}`
|
18 |
+
- **C:** Base models pretrained on TeaBReaC and then finetuned on target datasets: `teabreac-{base_model}-{target_dataset}`
|
19 |
|
20 |
The `base_model` above can be from: `bart-large`, `t5-large`, `t5-3b`, `nt5-small`, `preasm-large`.
|
21 |
The `target_dataset` above can be from: `drop`, `tatqa`, `iirc-gold`, `iirc-retrieved`, `numglue`.
|
|
|
27 |
Please checkout the details in our [github repository](https://github.com/stonybrooknlp/teabreac), but in a nutshell:
|
28 |
|
29 |
```python
|
|
|
30 |
from transformers import AutoTokenizer, AutoModelForSeq2SeqLM
|
31 |
from digit_tokenization import enable_digit_tokenization # digit_tokenization.py from https://github.com/stonybrooknlp/teabreac
|
32 |
|