voice cloning?

#1
by krigeta - opened

voice cloning?

StepFun org

Hi, Step-Audio-2-mini-Base is the base model for Step-Audio-2-mini and it aims for end-to-end speech conversation.

However, the base model should also have some zero-shot voice cloning ability, by prefilling the prompt text-audio interleaving tokens and completing new tokens based on given text.

This usage is not included in our examples.py.

Sign up or log in to comment