Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
johannhartmann
's Collections
GUI Intelligence
Document & UI Intelligence
Multimodal Models
Medical MultiModal
GUI Intelligence
updated
11 days ago
Upvote
-
bytedance-research/UI-TARS-72B-DPO
Image-Text-to-Text
•
Updated
13 days ago
•
9.96k
•
77
bytedance-research/UI-TARS-7B-DPO
Image-Text-to-Text
•
Updated
13 days ago
•
25.8k
•
122
microsoft/OmniParser
Image-Text-to-Text
•
Updated
Dec 2, 2024
•
2.02k
•
1.55k
jadechoghari/Ferret-UI-Llama8b
Image-Text-to-Text
•
Updated
30 days ago
•
375
•
65
Upvote
-
Share collection
View history
Collection guide
Browse collections