![](https://cdn-avatars.huggingface.co/v1/production/uploads/63e004bdf0c75dfb87690d5c/AtX_SwdU0j5UkJdDEkNsS.png)
InvokeAI/ip_adapter_sd_image_encoder
Updated
•
10.6k
•
11
a tiny vision language model
Generate text descriptions from images
Analyze image to generate descriptive prompt
Meta Llama3 8b with Llava Multimodal capabilities
Display a user interface for various tasks
Convert GUI screen to structured elements
Generate detailed image descriptions for prompts
Generate detailed image descriptions
Upload images and get detailed descriptions