Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

  1. Fluent text-to-speech generation in Japanese
  2. One-shot voice cloning through speech prompt

logo

Usage

Plesae check out our HF Spaces demo.

Model Details

  • Model type: Our model is end-to-end transformers.
  • Language(s): Japanese
  • Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

  • We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month
34
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Collection including kotoba-tech/kotoba-speech-v0.1