Upload audio to get a text response and voice reply
Generate answers and speech from voice or text input