Text-to-Speech
Finnish
File size: 1,215 Bytes
36e015a
 
 
 
 
122c149
36e015a
 
 
 
f817a3b
2ab0b09
 
4c7081e
e0e22e1
8a137ed
 
8b9bf83
e0e22e1
 
 
 
 
8486f2f
e0e22e1
78761f0
 
 
104e79a
78761f0
 
8b9bf83
e0e22e1
03d2d74
e0e22e1
 
fafc1c5
8486f2f
77d14db
78761f0
 
 
104e79a
78761f0
 
c3af2ad
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
---
license: cc-by-nc-4.0
datasets:
- mozilla-foundation/common_voice_17_0
- facebook/voxpopuli
- mrfakename/librivox-full-catalog-archive
language:
- fi
base_model:
- SWivid/F5-TTS
pipeline_tag: text-to-speech
---

Here are two Finnish models of the F5-TTS, listen speech samples for models.

Numbers cannot be understood by models. Convert numbers to words.

--- --- ---

The Common Voice and Vox Populi Finnish datasets are used for the first round.

- 20241206

- Speakers: Several speakers from different corpus

- Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_common_voice_fi_vox_populi_fi_20241206.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/vocab.txt

--- --- ---

The second round is based on the Common Voice, LibriVox and Vox Populi Finnish data sets. Use this as a default one.

- 20241217

- Speakers: Several speakers from different corpus

- Use these with "f5-tts_infer-gradio":

Model: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/model_last_20241217.safetensors

Vocab: hf://AsmoKoskinen/F5-TTS_Finnish_Model/model_commonvoice_fi_librivox_fi_vox_populi_fi_20241217/vocab.txt

--- --- ---