Amphion Text-to-Audio Pretrained Models

We provide the following pretrained checkpoints for you to use, specifically:

Two AudioLDM pretrained checkpoints with corresponding AutoencoderKL checkpoints trained on AudioCaps.

Quick Start

To utilize the pretrained models, just run the following commands:

Step1: Download the checkpoint

git lfs install
git clone https://huggingface.co/amphion/text_to_audio

Step2: Clone the Amphion's Source Code of GitHub

git clone https://github.com/open-mmlab/Amphion.git

Step3: Specify the checkpoint's path

Use the soft link to specify the downloaded checkpoint in the first step:

cd Amphion
mkdir -p ckpts
ln -s  ../../../text_to_speech/tta  ckpts/

Step4: Inference

You can follow the inference part of this recipe to generate audio from text.

We also provided an online demo, feel free to try it!

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.