license: apache-2.0 | |
library_name: kittentts | |
# Kitten TTS Nano 0.2 π» | |
Kitten TTS is an open-source realistic text-to-speech model with just 15 million parameters, designed for lightweight deployment and high-quality voice synthesis. | |
*Currently in developer preview* | |
## β¨ Features | |
- **Ultra-lightweight**: Model size less than 25MB | |
- **CPU-optimized**: Runs without GPU on any device | |
- **High-quality voices**: Several premium voice options available | |
- **Fast inference**: Optimized for real-time speech synthesis | |
## π Quick Start | |
### Installation | |
``` | |
pip install https://github.com/KittenML/KittenTTS/releases/download/0.1/kittentts-0.1.0-py3-none-any.whl | |
``` | |
### Basic Usage | |
``` | |
from kittentts import KittenTTS | |
m = KittenTTS("KittenML/kitten-tts-nano-0.2") | |
audio = m.generate("This high quality TTS model works without a GPU", voice='expr-voice-2-f' ) | |
# available_voices : [ 'expr-voice-2-m', 'expr-voice-2-f', 'expr-voice-3-m', 'expr-voice-3-f', 'expr-voice-4-m', 'expr-voice-4-f', 'expr-voice-5-m', 'expr-voice-5-f' ] | |
# Save the audio | |
import soundfile as sf | |
sf.write('output.wav', audio, 24000) | |
``` | |
## π» System Requirements | |
Works literally everywhere | |
## Checklist | |
- [x] Release a preview model | |
- [ ] Release the fully trained model weights | |
- [ ] Release mobile SDK | |
- [ ] Release web version | |