Real-time video generation
Generate high-quality videos from text prompts and images
Audio-Driven Multi-Person Conversational Video Generation
Generate audio for a video using captions and descriptions
Control 3D models using hand gestures and voice commands
Audio Conditioned LipSync with Latent Diffusion Models