Add pipeline tag, library name and link to paper
Browse filesThis PR improves the model card, by:
- making sure the model can be found at https://huggingface.co/models?pipeline_tag=feature-extraction.
- makes it clear that the `library_name` is `funasr`
- adds the paper link
README.md
CHANGED
@@ -1,8 +1,12 @@
|
|
1 |
---
|
2 |
license: apache-2.0
|
|
|
|
|
3 |
---
|
|
|
4 |
# Step-Audio-Tokenizer
|
5 |
|
|
|
6 |
|
7 |
Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
|
8 |
|
|
|
1 |
---
|
2 |
license: apache-2.0
|
3 |
+
library_name: funasr
|
4 |
+
pipeline_tag: feature-extraction
|
5 |
---
|
6 |
+
|
7 |
# Step-Audio-Tokenizer
|
8 |
|
9 |
+
This repository contains the tokenizer model described in the paper [Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction](https://arxiv.org/abs/2502.11946).
|
10 |
|
11 |
Step-Audio LLM is the industry’s first 130-billion parameter hu-manlike unified end-to-end model that integrates multimodal speech un-derstanding and generation capabilities, including singing voice synthesis, tool utilization, role-play and multilingual/dialectal comprehension and synthesis.
|
12 |
|