--- library_name: transformers pipeline_tag: image-text-to-text --- This repository contains the model weights for the paper [HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction](https://huggingface.co/papers/2412.13187). Code: https://github.com/Kami-code/HandsOnVLM-release