Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
riddhimanrana
/
fastvlm-0.5b-captions
like
0
Image-Text-to-Text
Transformers
Core ML
Safetensors
MLX
riddhimanrana/coco-fastvlm-2k-val2017
English
llava_qwen2
text-generation
finetuned
4bit
multimodal
conversational
arxiv:
2412.13303
arxiv:
1910.09700
License:
apple-amlr
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
fastvlm-0.5b-captions
/
demo
Ctrl+K
Ctrl+K
1 contributor
History:
1 commit
riddhimanrana
Upload demo.gif
26c7ab9
verified
3 months ago
demo.gif
Safe
354 kB
xet
Upload demo.gif
3 months ago