Multi-lingual Question Generating Model (mt5-base)
Give the model a passage and it will generate a question about the passage.
Trained on the following datasets:
- SQuAD (English)
- TyDiQA-GoldP (Arabic, Bengali, Finnish, Japanese, Indonesian, Kiswahili, Korean, Russian, Telugu, Thai)
- MLQA (Arabic, Chinese, English, German, Hindi, Spanish, Vietnames)
- XQuAD (Arabic, Chinese, German, Greek, Hindi, Russian, Spanish, Thai, Turkish Vietnamese)
- GermanQuAD (German)
- Persian QA (Persian)
- Bengali QA (Bengali)
- chaii (Hindi, Tamil)
Training details
I used flax summarization script and a TPU v3-8. Summarization expects a text column and a summary column. For question generation training, use the context column instead of text column and question instead of summary column.
There is no guarantee that it will produce a question in the language of the passage, but it usually does.
Model trained on Cloud TPUs from Google's TPU Research Cloud (TRC)