sanchit-gandhi commited on
Commit
2f42453
·
1 Parent(s): 6601d2d

shorten intro

Browse files
Files changed (1) hide show
  1. app.py +2 -4
app.py CHANGED
@@ -132,16 +132,14 @@ with gr.Blocks(css=css) as block:
132
  )
133
  gr.HTML(
134
  f"""
135
- <p><a href="https://github.com/huggingface/parler-tts"> Parler-TTS</a> is a training and inference library for
136
- high-fidelity text-to-speech (TTS) models. The model demonstrated here, <a href="https://huggingface.co/parler-tts/parler_tts_mini_expresso_v0.1"> Parler-TTS Mini: Expresso v0.1</a>,
137
- is fine-tuned on the <a href="https://huggingface.co/datasets/ylacombe/expresso"> Expresso dataset</a>.
138
  It generates high-quality speech in a given <b>emotion</b> and <b>voice</b> that can be controlled through a simple text prompt.</p>
139
 
140
  <p>Tips for ensuring good generation:
141
  <ul>
142
  <li>Specify the name of a male speaker (Jerry, Thomas) or female speaker (Talia, Elisabeth) for consistent voices</li>
143
  <li>The model can generate in a range of emotions, including: "happy", "confused", "default" (meaning no particular emotion conveyed), "laughing", "sad", "whisper", "emphasis"</li>
144
- <li>Include the term "high quality audio" to generate the highest quality audio, and "very noisy audio" for high levels of background noise</li>
145
  <li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
146
  <li>Wrap words in asterisk to emphasise them (e.g. `*Remember*` in the example below)</li>
147
  </ul>
 
132
  )
133
  gr.HTML(
134
  f"""
135
+ <p><a href="https://huggingface.co/parler-tts/parler_tts_mini_expresso_v0.1"> Parler-TTS Mini: Expresso v0.1</a>
136
+ is a text-to-speech (TTS) model fine-tuned on the <a href="https://huggingface.co/datasets/ylacombe/expresso"> Expresso dataset</a>.
 
137
  It generates high-quality speech in a given <b>emotion</b> and <b>voice</b> that can be controlled through a simple text prompt.</p>
138
 
139
  <p>Tips for ensuring good generation:
140
  <ul>
141
  <li>Specify the name of a male speaker (Jerry, Thomas) or female speaker (Talia, Elisabeth) for consistent voices</li>
142
  <li>The model can generate in a range of emotions, including: "happy", "confused", "default" (meaning no particular emotion conveyed), "laughing", "sad", "whisper", "emphasis"</li>
 
143
  <li>Punctuation can be used to control the prosody of the generations, e.g. use commas to add small breaks in speech</li>
144
  <li>Wrap words in asterisk to emphasise them (e.g. `*Remember*` in the example below)</li>
145
  </ul>