torch
tokenizers
transformers
datasets
gradio>=3.0.0
soundfile
sentencepiece
librosa