voice cloning?

by krigeta - opened 4 days ago

Discussion

krigeta

4 days ago

voice cloning?

petronny

StepFun org 4 days ago

Hi, Step-Audio-2-mini-Base is the base model for Step-Audio-2-mini and it aims for end-to-end speech conversation.

However, the base model should also have some zero-shot voice cloning ability, by prefilling the prompt text-audio interleaving tokens and completing new tokens based on given text.

This usage is not included in our examples.py.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment