metadata
library_name: transformers
pipeline_tag: image-text-to-text
This repository contains the HandsOnVLM model presented in the paper HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction.
Project page: https://www.chenbao.tech/handsonvlm/ Code: https://github.com/Kami-code/HandsOnVLM-release