llavallava
/
smolvlm-instruct-trl-dpo-rlaif-v

Model card Files Files and versions Metrics Training metrics Community