Co-Speech 3D Gesture Generation
Generate text from audio recordings
F5-TTS & E2-TTS: Zero-Shot Voice Cloning (Unofficial Demo)
Co-Speech Gesture Video Generation