Malay
File size: 928 Bytes
84d27e0
 
 
 
 
fe379af
 
 
 
fd6542b
63ccb21
db98433
 
fd6542b
 
db98433
63ccb21
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
datasets:
- mesolitica/TTS
language:
- ms
---

# StyleTTS2 MS

Forked at https://github.com/malaysia-ai/StyleTTS2-MS, only trained on first stage.

## Pre-trained modules

1. Forked original [yl4579/AuxiliaryASR](https://github.com/yl4579/AuxiliaryASR) at [malaysia-ai/AuxiliaryASR-Phonemizer](https://github.com/malaysia-ai/AuxiliaryASR-Phonemizer) to use `ms` phonemizer and trained on [mesolitica/tts-combine-annotated](https://huggingface.co/datasets/mesolitica/tts-combine-annotated) dataset.
2. Forked original [PL-BERT](https://arxiv.org/abs/2301.08810) at [malaysia-ai/PL-BERT-MS](https://github.com/malaysia-ai/PL-BERT-MS) to use custom word tokenizer and pretrained on Malay Wikipedia and local news.

## Checkpoints

We uploaded full checkpoints with optimizer states at [checkpoints-first-stage](checkpoints-first-stage).

## Dataset

We train on [Mesolitica/TTS](https://huggingface.co/datasets/mesolitica/TTS).