dathudeptrai commited on
Commit
f9db22e
·
1 Parent(s): d28ab58

🦋 Update README

Browse files
Files changed (1) hide show
  1. README.md +79 -0
README.md ADDED
@@ -0,0 +1,79 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - tensorflowtts
4
+ - audio
5
+ - text-to-speech
6
+ - text-to-mel
7
+ language: eng
8
+ license: apache-2.0
9
+ datasets:
10
+ - LJSpeech
11
+ widget:
12
+ - text: "How are you?"
13
+ ---
14
+
15
+ # FastSpeech trained on LJSpeech (Eng))
16
+ This repository provides a pretrained [FastSpeech](https://arxiv.org/abs/1905.09263) trained on LJSpeech dataset (ENG). For a detail of the model, we encourage you to read more about
17
+ [TensorFlowTTS](https://github.com/TensorSpeech/TensorFlowTTS).
18
+
19
+
20
+ ## Install TensorFlowTTS
21
+ First of all, please install TensorFlowTTS with the following command:
22
+ ```
23
+ pip install TensorFlowTTS
24
+ ```
25
+
26
+ ### Converting your Text to Mel Spectrogram
27
+ ```python
28
+ from tensorflow_tts.inference import AutoProcessor
29
+ from tensorflow_tts.inference import TFAutoModel
30
+
31
+ processor = AutoProcessor.from_pretrained("tensorspeech/tts-fastspeech-ljspeech-en")
32
+ fastspeech = TFAutoModel.from_pretrained("tensorspeech/tts-fastspeech-ljspeech-en")
33
+
34
+ text = "How are you?"
35
+
36
+ input_ids = processor.text_to_sequence(text)
37
+
38
+ mel_before, mel_after, duration_outputs = fastspeech.inference(
39
+ input_ids=tf.expand_dims(tf.convert_to_tensor(input_ids, dtype=tf.int32), 0),
40
+ speaker_ids=tf.convert_to_tensor([0], dtype=tf.int32),
41
+ speed_ratios=tf.convert_to_tensor([1.0], dtype=tf.float32),
42
+ )
43
+ ```
44
+
45
+ #### Referencing FastSpeech
46
+ ```
47
+ @article{DBLP:journals/corr/abs-1905-09263,
48
+ author = {Yi Ren and
49
+ Yangjun Ruan and
50
+ Xu Tan and
51
+ Tao Qin and
52
+ Sheng Zhao and
53
+ Zhou Zhao and
54
+ Tie{-}Yan Liu},
55
+ title = {FastSpeech: Fast, Robust and Controllable Text to Speech},
56
+ journal = {CoRR},
57
+ volume = {abs/1905.09263},
58
+ year = {2019},
59
+ url = {http://arxiv.org/abs/1905.09263},
60
+ archivePrefix = {arXiv},
61
+ eprint = {1905.09263},
62
+ timestamp = {Wed, 11 Nov 2020 08:48:07 +0100},
63
+ biburl = {https://dblp.org/rec/journals/corr/abs-1905-09263.bib},
64
+ bibsource = {dblp computer science bibliography, https://dblp.org}
65
+ }
66
+ ```
67
+
68
+ #### Referencing TensorFlowTTS
69
+ ```
70
+ @misc{TFTTS,
71
+ author = {Minh Nguyen, Alejandro Miguel Velasquez, Erogol, Kuan Chen, Dawid Kobus, Takuya Ebata,
72
+ Trinh Le and Yunchao He},
73
+ title = {TensorflowTTS},
74
+ year = {2020},
75
+ publisher = {GitHub},
76
+ journal = {GitHub repository},
77
+ howpublished = {\\url{https://github.com/TensorSpeech/TensorFlowTTS}},
78
+ }
79
+ ```