mt5-base-qgen / README.md
nbroad's picture
Update ReadMe with dataset info
fec6f45
|
raw
history blame
760 Bytes

Multi-lingual Question Generating Model (mt5-base)

Give the model a passage and it will generate a question about the passage.

Trained on the following datasets:

  • SQuAD (English)
  • TyDiQA-GoldP (Arabic, Bengali, Finnish, Japanese, Indonesian, Kiswahili, Korean, Russian, Telugu, Thai)
  • MLQA (Arabic, Chinese, English, German, Hindi, Spanish, Vietnames)
  • XQuAD (Arabic, Chinese, German, Greek, Hindi, Russian, Spanish, Thai, Turkish Vietnamese)
  • GermanQuAD (German)
  • Persian QA (Persian)
  • Bengali QA (Bengali)
  • Chaii QA (Hindi, Tamil)

There is no guarantee that it will produce a question in the language of the passage, but it usually does.

Model made using the flax summarization script on Cloud TPUs from Google's TPU Research Cloud (TRC)