docling-project/SmolDocling-256M-preview
Image-Text-to-Text
•
0.3B
•
Updated
•
19.2k
•
1.61k
Generate a talking-head video from an image and audio
Generate any application by Vibe Coding
Try on clothes on a person image
Blazingly Fast and Embarrassingly Simple Song Generation