HKUST-Audio commited on
Commit
047e0b7
·
verified ·
1 Parent(s): c62ab72

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +19 -0
README.md ADDED
@@ -0,0 +1,19 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: cc-by-nc-4.0
3
+ language:
4
+ - zh
5
+ - en
6
+ base_model:
7
+ - meta-llama/Llama-3.2-1B-Instruct
8
+ tags:
9
+ - Text-to-Speech
10
+ pipeline_tag: text-to-speech
11
+ ---
12
+
13
+ [![arXiv](https://img.shields.io/badge/arXiv-Paper-<COLOR>.svg)](https://arxiv.org/abs/2502.04128)
14
+
15
+ **Update (2025-02-13):** Add [Llasa finetune instruction](https://github.com/zhenye234/LLaSA_training/tree/main/finetune).
16
+
17
+ These models are not mentioned in the original paper, they are essentially the same as LLaSA 1B and LLaSA 3B, except they have been fine-tuned with a mixed speech and text SFT dataset, which enables the model to retain text-based conversational abilities.
18
+
19
+ LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis