Stable audio open model from Synthio paper.
Analyze audio and answer questions about it
Answer questions about audio