
vikhyatk/moondream2
Image-Text-to-Text
β’
2B
β’
Updated
β’
185k
β’
1.28k
https://huggingface.co/papers/2501.03006
Detect and estimate human poses in images and videos
Generate text based on your input