Transcribe and translate audio into text
Transcribe audio from microphone, file, or YouTube link
Generate text based on input sentences