|
--- |
|
library_name: transformers |
|
license: cc-by-4.0 |
|
base_model: |
|
- HKUSTAudio/Llasa-1B |
|
--- |
|
|
|
## Model Description |
|
|
|
This is a 4bit GPTQ quantization of Llasa-1B by the HKUSTAudio team. |
|
I tested using a script written by GitHub user nivibilla, linked below. |
|
The tests were successful, but the quality of the generated voice is often unusable. |
|
In case you don't believe me, I'll leave this model up here so you can test it yourself. |
|
|
|
## Model Sources |
|
|
|
<!-- Provide the basic links for the model. --> |
|
|
|
- **Repository:** HKUSTAudio/Llasa-1B |
|
- **Paper:** LLaSA: Scaling Train-Time and Inference-Time Compute for LLaMA-based Speech Synthesis (Coming soon) |
|
- **Test Script:** https://github.com/slives-lab/local-llasa-tts_voice/blob/main/llasa_vllm_longtext_inference.ipynb |