Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
amete7
/
qvla
like
0
Image-Text-to-Text
Transformers
Safetensors
English
molmo
text-generation
multimodal
olmo
pixmo
conversational
custom_code
arxiv:
2409.17146
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
Use this model
main
qvla
/
preprocessing_molmo.py
Commit History
vla added but giving nans in loss
57b4d23
Atharva Mete
commited on
Jan 10
original molmo
303e3cf
Atharva Mete
commited on
Jan 7