Generate lip-synced video for audio and reference video
Create real-time lip-synchronized videos from audio