OmniParser, turn your LLM into GUI agent
Generate high-quality audio from text using various controls
Upload a video or image to get conversational explanations