Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
riddhimanrana
/
fastvlm-0.5b-captions
like
0
Image-Text-to-Text
Transformers
Core ML
Safetensors
MLX
riddhimanrana/coco-fastvlm-2k-val2017
English
llava_qwen2
text-generation
finetuned
4bit
multimodal
conversational
arxiv:
2412.13303
arxiv:
1910.09700
License:
apple-amlr
Model card
Files
Files and versions
xet
Community
Train
Deploy
Use this model
main
fastvlm-0.5b-captions
Ctrl+K
Ctrl+K
1 contributor
History:
19 commits
riddhimanrana
Update README.md
ce37ded
verified
2 days ago
demo
Upload demo.gif
2 months ago
fastvithd.mlpackage
Upload 3 files
2 months ago
.gitattributes
Safe
1.67 kB
Upload demo.gif
2 months ago
README.md
9.92 kB
Update README.md
2 days ago
added_tokens.json
Safe
101 Bytes
Upload model
2 months ago
config.json
Safe
1.45 kB
Upload model
2 months ago
merges.txt
Safe
1.67 MB
Upload model
2 months ago
model.safetensors
Safe
357 MB
xet
Upload model
2 months ago
model.safetensors.index.json
Safe
54.4 kB
Upload model
2 months ago
predict.py
Safe
3.68 kB
Create predict.py
2 months ago
preprocessor_config.json
Safe
467 Bytes
Upload model
2 months ago
processor_config.json
Safe
168 Bytes
Upload model
2 months ago
special_tokens_map.json
Safe
367 Bytes
Upload model
2 months ago
tokenizer.json
Safe
11.4 MB
xet
Upload model
2 months ago
tokenizer_config.json
Safe
1.64 kB
Upload model
2 months ago
vocab.json
Safe
2.78 MB
Upload model
2 months ago