Would you release the flow matching & Hifi-GAN model weights?

by JosephusCheung - opened 19 days ago

19 days ago

For role-aware speech synthesis, we utilize the pre-trained conditional flow matching and HiFi-GAN weights from GLM-4-Voice (Zeng et al., 2024) and fine-tune them on high-quality role speech data.

haonanzhang

Tongyi-ConvAI org 19 days ago

Sorry, due to company policy, we are unable to release the weights of the flow matching and HiFi-GAN models. We’ll be uploading character audio data later, and you can fine-tune using the pretrained GLM-4-Voice weights from that.

haonanzhang changed discussion status to closed 6 days ago

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment