The best Arabic-English VLM developed by MBZUAI.
Co-Speech Gesture Video Generation
Generate realistic audio from text
Generate images from text descriptions
Create images from various types of annotations