FreeSVC: Zero-shot Multilingual Singing Voice Conversion

FreeSVC is a promising multilingual zero-shot singing voice conversion model. It enables the conversion of singing voices across languages without the need for extensive language-specific training. GitHub repository. Paper arXiv pre-print.

Supported Languages

Language ID Status Speech Data Singing Data
Chinese 0 โœ… Full 255h 70h
Dutch 1 โœ… Full Part of CML -
English 2 โœ… Full 921h 47h
French 3 โœ… Full Part of CML -
German 4 โœ… Full Part of CML -
Italian 5 โœ… Full Part of CML -
Japanese 6 โœ… Full 30h -
Other* 7 โš ๏ธ Partial - 10h
Polish 8 โœ… Full Part of CML -
Portuguese 9 โœ… Full Part of CML -
Spanish 10 โœ… Full Part of CML -

*Note: The "Other" category is used for vocal techniques without content.

Model Overview

FreeSVC leverages an enhanced VITS architecture integrated with Speaker-invariant Clustering (SPIN) and the ECAPA2 speaker encoder. This combination effectively separates speaker characteristics from linguistic content, ensuring high-quality and natural-sounding voice conversions across multiple languages.

Training Datasets

FreeSVC was trained on a diverse set of speech and singing datasets covering multiple languages:

Dataset Hours Language Type
AISHELL-1 170h Chinese Speech
AISHELL-3 85h Chinese Speech
CML-TTS 3.1k 7 Languages Speech
HiFiTTS 292h English Speech
JVS 30h Japanese Speech
LibriTTS-R 585h English Speech
NUS (NHSS) 7h English Speech, Singing
OpenSinger 50h Chinese Singing
Opencpop 5h Chinese Singing
PopBuTFy 10h, 40h Chinese, English Singing
POPCS 5h Chinese Singing
VCTK 44h English Speech
VocalSet 10h Other Singing

Citation

@misc{ferreira2025freesvczeroshotmultilingualsinging,
      title={FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion}, 
      author={Alef Iury Siqueira Ferreira and Lucas Rafael Gris and Augusto Seben da Rosa and Frederico Santos de Oliveira and Edresson Casanova and Rafael Teixeira Sousa and Arnaldo Candido Junior and Anderson da Silva Soares and Arlindo Galvรฃo Filho},
      year={2025},
      eprint={2501.05586},
      archivePrefix={arXiv},
      primaryClass={cs.SD},
      url={https://arxiv.org/abs/2501.05586}, 
}
Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.