llavallava
/
qwen2vl2b-instruct-trl-dpo-0_0.1_epochs1_nonref

Model card Files Files and versions Metrics Training metrics Community