Generate speech from text
Communicate with a multimodal chatbot
Generate responses using text and images