
google/siglip-so400m-patch14-384
Zero-Shot Image Classification
•
Updated
•
10.5M
•
•
495
Generate captions for images in various styles
Generate depth maps from your images
Tuning-free subject-driven generation
A text-to-speech model powered by SparkAudio and Mobvoi.
Audio to Talking Face