shuaijiang commited on
Commit
da5f0db
·
1 Parent(s): 795c3a9
index.html CHANGED
@@ -134,6 +134,7 @@
134
  This real-time speech interaction is particularly valuable in scenarios requiring rapid feedback and immediate responses, dramatically enhancing user experience.
135
  However, there is a notable lack of research focused on real-time large speech language models, particularly for Chinese.
136
  In this work, we present KE-Omni, a seamless large speech language model built upon Ke-SpeechChat, a large-scale high-quality synthesized speech interaction dataset consisting of 7 million Chinese and English conversations, featuring 42,002 speakers, and totaling over 60,000 hours, This contributes significantly to the advancement of research and development in this field.
 
137
  </p>
138
 
139
  </div>
@@ -157,7 +158,7 @@
157
  </div>
158
  </div>
159
 
160
- <h2 class="title is-2 has-text-centered" >KE-SpeechChat Dataset Construction</h2>
161
  <div class="columns is-centered">
162
  <div class="column">
163
  <div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
@@ -198,14 +199,14 @@
198
  <div class="column">
199
  <div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
200
  <h3 class="title is-4">User-male, Agent-male</h3>
201
- <p>User: 奥运会是不是每四年举办一次啊</p>
202
- <p>Agent: 是的,奥运会确实是每四年举办一次,不过有时候会因为特殊情况有所调整,比如之前就因为全球健康危机延期过。</p>
203
  <div style="margin-top: auto;">
204
  <audio controls>
205
- <source src="./static/audios/zh_26157_0_male.wav" type="audio/wav">
206
  </audio>
207
  <audio controls>
208
- <source src="./static/audios/zh_26157_1_male.wav" type="audio/wav">
209
  </audio>
210
  </div>
211
  </div>
@@ -246,14 +247,14 @@
246
  <div class="column">
247
  <div class="content" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
248
  <h3 class="title is-4">User-female, Agent-female</h3>
249
- <p>User: 如果我想提高写作水平,多读书真的有帮助吗?</p>
250
- <p>Agent: 当然有帮助,多读书可以让你接触到不同的写作风格和表达方式,还能增加词汇量,对提高写作水平特别有用。</p>
251
  <div style="margin-top: auto;">
252
  <audio controls>
253
- <source src="./static/audios/zh_2518949_0_female.wav" type="audio/wav">
254
  </audio>
255
  <audio controls>
256
- <source src="./static/audios/zh_2518949_1_female.wav" type="audio/wav">
257
  </audio>
258
  </div>
259
  </div>
 
134
  This real-time speech interaction is particularly valuable in scenarios requiring rapid feedback and immediate responses, dramatically enhancing user experience.
135
  However, there is a notable lack of research focused on real-time large speech language models, particularly for Chinese.
136
  In this work, we present KE-Omni, a seamless large speech language model built upon Ke-SpeechChat, a large-scale high-quality synthesized speech interaction dataset consisting of 7 million Chinese and English conversations, featuring 42,002 speakers, and totaling over 60,000 hours, This contributes significantly to the advancement of research and development in this field.
137
+ Furthermore, we plan to release the code and models after a thorough risk assessment.
138
  </p>
139
 
140
  </div>
 
158
  </div>
159
  </div>
160
 
161
+ <h2 class="title is-2 has-text-centered" >Ke-SpeechChat Dataset Construction</h2>
162
  <div class="columns is-centered">
163
  <div class="column">
164
  <div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
 
199
  <div class="column">
200
  <div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
201
  <h3 class="title is-4">User-male, Agent-male</h3>
202
+ <p>User: 北京一年的平均气温是多少度啊?</p>
203
+ <p>Agent: 北京一年的平均气温大概在十二度左右,不过四季温差挺大的,冬天会冷很多,夏天又热一些。</p>
204
  <div style="margin-top: auto;">
205
  <audio controls>
206
+ <source src="./static/audios/belle1M_openQA_belle1M_openQA_1678197_0_mark.wav" type="audio/wav">
207
  </audio>
208
  <audio controls>
209
+ <source src="./static/audios/belle1M_openQA_belle1M_openQA_1678197_1_mark.wav" type="audio/wav">
210
  </audio>
211
  </div>
212
  </div>
 
247
  <div class="column">
248
  <div class="content" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
249
  <h3 class="title is-4">User-female, Agent-female</h3>
250
+ <p>User: 这句话:读万卷书,行万里路是什么意思呢?</p>
251
+ <p>Agent: 这句话的意思是说,读书和实践都很重要。读万卷书指的是要多读书,增长知识;行万里路则是指要多出去走走,增长见识。结合起来就是理论和实践都要兼顾,这样才能更好地成长。</p>
252
  <div style="margin-top: auto;">
253
  <audio controls>
254
+ <source src="./static/audios/bella_openQA_bella_openQA_916716_0_mark.wav" type="audio/wav">
255
  </audio>
256
  <audio controls>
257
+ <source src="./static/audios/bella_openQA_bella_openQA_916716_1_mark.wav" type="audio/wav">
258
  </audio>
259
  </div>
260
  </div>
static/audios/bella_openQA_bella_openQA_916716_0_mark.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2f640dfdf952f82a4a067135c6dba4746cc54eca48a70b9edf036315cae62ce7
3
+ size 458634
static/audios/bella_openQA_bella_openQA_916716_1_mark.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b64f182e76128686ab68af687382949d568abed80527adcbc39927b63045da1c
3
+ size 1690522
static/audios/belle1M_openQA_belle1M_openQA_1678197_0_mark.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5556d108b07be4875ddd3c218432193bad50b2c535c2493c34917475e35ca3ed
3
+ size 377738
static/audios/belle1M_openQA_belle1M_openQA_1678197_1_mark.wav ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:788efaa7bf543bddf0c8fde4083ccc4a5dc7c682bbb93c486fd1f68daeb6f5f6
3
+ size 728970