Generate descriptions for images using text prompts
Generate high-fidelity audio from input audio waveforms
Generate insights from charts using text prompts