Transcribe audio files into text
Transcribe audio to text with speaker diarization
Convert spoken words into text