Commit
·
da5f0db
1
Parent(s):
795c3a9
update
Browse files
index.html
CHANGED
@@ -134,6 +134,7 @@
|
|
134 |
This real-time speech interaction is particularly valuable in scenarios requiring rapid feedback and immediate responses, dramatically enhancing user experience.
|
135 |
However, there is a notable lack of research focused on real-time large speech language models, particularly for Chinese.
|
136 |
In this work, we present KE-Omni, a seamless large speech language model built upon Ke-SpeechChat, a large-scale high-quality synthesized speech interaction dataset consisting of 7 million Chinese and English conversations, featuring 42,002 speakers, and totaling over 60,000 hours, This contributes significantly to the advancement of research and development in this field.
|
|
|
137 |
</p>
|
138 |
|
139 |
</div>
|
@@ -157,7 +158,7 @@
|
|
157 |
</div>
|
158 |
</div>
|
159 |
|
160 |
-
<h2 class="title is-2 has-text-centered" >
|
161 |
<div class="columns is-centered">
|
162 |
<div class="column">
|
163 |
<div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
@@ -198,14 +199,14 @@
|
|
198 |
<div class="column">
|
199 |
<div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
200 |
<h3 class="title is-4">User-male, Agent-male</h3>
|
201 |
-
<p>User:
|
202 |
-
<p>Agent:
|
203 |
<div style="margin-top: auto;">
|
204 |
<audio controls>
|
205 |
-
<source src="./static/audios/
|
206 |
</audio>
|
207 |
<audio controls>
|
208 |
-
<source src="./static/audios/
|
209 |
</audio>
|
210 |
</div>
|
211 |
</div>
|
@@ -246,14 +247,14 @@
|
|
246 |
<div class="column">
|
247 |
<div class="content" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
248 |
<h3 class="title is-4">User-female, Agent-female</h3>
|
249 |
-
<p>User:
|
250 |
-
<p>Agent:
|
251 |
<div style="margin-top: auto;">
|
252 |
<audio controls>
|
253 |
-
<source src="./static/audios/
|
254 |
</audio>
|
255 |
<audio controls>
|
256 |
-
<source src="./static/audios/
|
257 |
</audio>
|
258 |
</div>
|
259 |
</div>
|
|
|
134 |
This real-time speech interaction is particularly valuable in scenarios requiring rapid feedback and immediate responses, dramatically enhancing user experience.
|
135 |
However, there is a notable lack of research focused on real-time large speech language models, particularly for Chinese.
|
136 |
In this work, we present KE-Omni, a seamless large speech language model built upon Ke-SpeechChat, a large-scale high-quality synthesized speech interaction dataset consisting of 7 million Chinese and English conversations, featuring 42,002 speakers, and totaling over 60,000 hours, This contributes significantly to the advancement of research and development in this field.
|
137 |
+
Furthermore, we plan to release the code and models after a thorough risk assessment.
|
138 |
</p>
|
139 |
|
140 |
</div>
|
|
|
158 |
</div>
|
159 |
</div>
|
160 |
|
161 |
+
<h2 class="title is-2 has-text-centered" >Ke-SpeechChat Dataset Construction</h2>
|
162 |
<div class="columns is-centered">
|
163 |
<div class="column">
|
164 |
<div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
|
|
199 |
<div class="column">
|
200 |
<div class="content dialogue-block" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
201 |
<h3 class="title is-4">User-male, Agent-male</h3>
|
202 |
+
<p>User: 北京一年的平均气温是多少度啊?</p>
|
203 |
+
<p>Agent: 北京一年的平均气温大概在十二度左右,不过四季温差挺大的,冬天会冷很多,夏天又热一些。</p>
|
204 |
<div style="margin-top: auto;">
|
205 |
<audio controls>
|
206 |
+
<source src="./static/audios/belle1M_openQA_belle1M_openQA_1678197_0_mark.wav" type="audio/wav">
|
207 |
</audio>
|
208 |
<audio controls>
|
209 |
+
<source src="./static/audios/belle1M_openQA_belle1M_openQA_1678197_1_mark.wav" type="audio/wav">
|
210 |
</audio>
|
211 |
</div>
|
212 |
</div>
|
|
|
247 |
<div class="column">
|
248 |
<div class="content" style="display: flex; flex-direction: column; justify-content: space-between; height: 100%;">
|
249 |
<h3 class="title is-4">User-female, Agent-female</h3>
|
250 |
+
<p>User: 这句话:读万卷书,行万里路是什么意思呢?</p>
|
251 |
+
<p>Agent: 这句话的意思是说,读书和实践都很重要。读万卷书指的是要多读书,增长知识;行万里路则是指要多出去走走,增长见识。结合起来就是理论和实践都要兼顾,这样才能更好地成长。</p>
|
252 |
<div style="margin-top: auto;">
|
253 |
<audio controls>
|
254 |
+
<source src="./static/audios/bella_openQA_bella_openQA_916716_0_mark.wav" type="audio/wav">
|
255 |
</audio>
|
256 |
<audio controls>
|
257 |
+
<source src="./static/audios/bella_openQA_bella_openQA_916716_1_mark.wav" type="audio/wav">
|
258 |
</audio>
|
259 |
</div>
|
260 |
</div>
|
static/audios/bella_openQA_bella_openQA_916716_0_mark.wav
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2f640dfdf952f82a4a067135c6dba4746cc54eca48a70b9edf036315cae62ce7
|
3 |
+
size 458634
|
static/audios/bella_openQA_bella_openQA_916716_1_mark.wav
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:b64f182e76128686ab68af687382949d568abed80527adcbc39927b63045da1c
|
3 |
+
size 1690522
|
static/audios/belle1M_openQA_belle1M_openQA_1678197_0_mark.wav
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5556d108b07be4875ddd3c218432193bad50b2c535c2493c34917475e35ca3ed
|
3 |
+
size 377738
|
static/audios/belle1M_openQA_belle1M_openQA_1678197_1_mark.wav
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:788efaa7bf543bddf0c8fde4083ccc4a5dc7c682bbb93c486fd1f68daeb6f5f6
|
3 |
+
size 728970
|