Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
llavallava
/
qwen2vl2b-instruct-trl-dpo-0_0.1_epochs1_nonref
like
0
Image-to-Text
Transformers
TensorBoard
Safetensors
qwen2_vl
Generated from Trainer
trl
dpo
text-generation-inference
arxiv:
2305.18290
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
main
qwen2vl2b-instruct-trl-dpo-0_0.1_epochs1_nonref
/
runs
Ctrl+K
Ctrl+K
1 contributor
History:
32 commits
llavallava
Model save
bf21a96
verified
7 months ago
Jan30_13-28-11_csr-95830.utdallas.edu
Model save
7 months ago