kotoba-tech
/

kotoba-speech-v0.1

Inference Endpoints

Model card Files Files and versions Community

Kotoba-Speech-v0.1

Kotoba-Speech v0.1 is a 1.2B Transformer-based speech generative model. It supports the following properties:

Fluent text-to-speech generation in Japanese
One-shot voice cloning through speech prompt

Usage

Plesae check out our HF Spaces demo.

Model Details

Model type: Our model is end-to-end transformers.
Language(s): Japanese
Library: We'll releasde our training code soon. Inference and model code are largely adopted from metavoice.

Acknowledgements

We thank meta-voice for opensourcing their code.

License

Apache License Version 2.0, January 2004

Downloads last month: 34

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Collection including kotoba-tech/kotoba-speech-v0.1

Kotoba-Speech

2 items • Updated Sep 30, 2024