Kami-code
/

handsonvlm-7b

Image-Text-to-Text

text-generation

Inference Endpoints

Model card Files Files and versions Community

handsonvlm-7b / README.md

nielsr's picture

nielsr HF staff

Add model card and metadata

5f839a5 verified about 2 months ago

|

361 Bytes

metadata

library_name: transformers
pipeline_tag: image-text-to-text

This repository contains the HandsOnVLM model presented in the paper HandsOnVLM: Vision-Language Models for Hand-Object Interaction Prediction.

Project page: https://www.chenbao.tech/handsonvlm/ Code: https://github.com/Kami-code/HandsOnVLM-release