FunAudioLLM
/

SenseVoiceSmall

Model card Files Files and versions

LZR9926 commited on Jul 3, 2024

Commit

53b8cde

·

verified ·

1 Parent(s): fd146c0

Update README.md

Files changed (1) hide show

README.md +3 -0

README.md CHANGED Viewed

@@ -41,8 +41,11 @@ SenseVoice-Small is an encoder-only speech foundation model designed for rapid v
 The SenseVoice-Small model is based on a non-autoregressive end-to-end framework. For a specified task, we prepend four embeddings as input to the encoder:
 LID: For predicting the language id of the audio.
 SER: For predicting the emotion label of the audio.
 AED: For predicting the event label of the audio.
 ITN: Used to specify whether the recognition output text is subjected to inverse text normalization.
 # Usage

 The SenseVoice-Small model is based on a non-autoregressive end-to-end framework. For a specified task, we prepend four embeddings as input to the encoder:
 LID: For predicting the language id of the audio.
 SER: For predicting the emotion label of the audio.
 AED: For predicting the event label of the audio.
 ITN: Used to specify whether the recognition output text is subjected to inverse text normalization.
 # Usage