Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
ZKong 's Collections
PyWheels
pose
dataset
Segment
hunyuan-video
Z-Image
tts
ocr
VL
qwen image
upscale
vae
wan2.2
qwen
sound
flux-kontext
image-process
prompt
面部AI
encoder
video
translate
motionCapture
flux
3D
image
audio

audio

updated Jul 16, 2025
Upvote
-

  • google-t5/t5-base

    Translation • 0.2B • Updated Feb 14, 2024 • 1.88M • • 761

  • stabilityai/stable-audio-open-1.0

    Text-to-Audio • Updated Jun 19, 2025 • 23.3k • 1.37k

  • Kijai/MMAudio_safetensors

    Updated Dec 11, 2024 • 64

  • nvidia/bigvgan_v2_44khz_128band_512x

    Audio-to-Audio • Updated Sep 5, 2024 • 312k • 63

  • hexgrad/Kokoro-82M

    Text-to-Speech • Updated Apr 10, 2025 • 2.86M • • 5.5k

  • mistralai/Voxtral-Mini-3B-2507

    5B • Updated Jul 28, 2025 • 456k • 602

  • mistralai/Voxtral-Small-24B-2507

    Audio-Text-to-Text • 24B • Updated 12 days ago • 14.2k • 441
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs