Edit Models filters

Tasks

Text Generation

Image-Text-to-Text

Parameters

Libraries

Transformers.js

Apps

Inference Providers

Models

8,167

Full-text search

Active filters: image-text-to-text

baidu/ERNIE-4.5-VL-28B-A3B-Thinking

Image-Text-to-Text • 30B • Updated about 3 hours ago • 1.52k • 297

deepseek-ai/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 8 days ago • 3.58M • • 2.64k

PaddlePaddle/PaddleOCR-VL

Image-Text-to-Text • 1.0B • Updated about 23 hours ago • 38.6k • 1.27k

jzhang533/PaddleOCR-VL-For-Manga

Image-Text-to-Text • 1.0B • Updated about 5 hours ago • 113 • 73

Qwen/Qwen3-VL-8B-Instruct

Image-Text-to-Text • 9B • Updated 28 days ago • 1.66M • • 423

Qwen/Qwen3-VL-30B-A3B-Instruct

Image-Text-to-Text • 31B • Updated Oct 9 • 2.27M • • 379

google/gemma-3-4b-it

Image-Text-to-Text • 4B • Updated Mar 21 • 935k • 950

ibm-granite/granite-docling-258M

Image-Text-to-Text • 0.3B • Updated Sep 23 • 103k • 1.01k

Qwen/Qwen3-VL-2B-Instruct

Image-Text-to-Text • 2B • Updated 20 days ago • 262k • 177

unsloth/DeepSeek-OCR

Image-Text-to-Text • 3B • Updated 3 days ago • 6.46k • 25

QuixiAI/Prisma-VL-8B

Image-Text-to-Text • 770k • Updated 1 day ago • 302 • 11

rednote-hilab/dots.ocr

Image-Text-to-Text • 3B • Updated 12 days ago • 1.07M • 1.12k

nanonets/Nanonets-OCR2-3B

Image-Text-to-Text • 4B • Updated 27 days ago • 92.1k • 441

huihui-ai/Huihui-Qwen3-VL-8B-Instruct-abliterated

Image-Text-to-Text • 9B • Updated 11 days ago • 30.1k • 70

mlx-community/DeepSeek-OCR-8bit

Image-Text-to-Text • 1B • Updated 16 days ago • 6.58k • 20

Qwen/Qwen2.5-VL-7B-Instruct

Image-Text-to-Text • 8B • Updated Apr 6 • 5M • • 1.34k

google/gemma-3-27b-it

Image-Text-to-Text • 27B • Updated Mar 21 • 837k • • 1.68k

google/medgemma-4b-it

Image-Text-to-Text • 4B • Updated 15 days ago • 473k • 748

zai-org/GLM-4.5V

Image-Text-to-Text • 108B • Updated 18 days ago • 43.3k • • 689

nvidia/NVIDIA-Nemotron-Nano-12B-v2-VL-BF16

Image-Text-to-Text • 13B • Updated 2 days ago • 11.7k • 48

Qwen/Qwen3-VL-8B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated 11 days ago • 6.48k • 10

mlabonne/gemma-3-27b-it-abliterated

Image-Text-to-Text • 27B • Updated Mar 21 • 5.09k • • 232

unsloth/Qwen2.5-VL-7B-Instruct-GGUF

Image-Text-to-Text • 8B • Updated May 12 • 82.5k • 98

Qwen/Qwen3-VL-4B-Instruct

Image-Text-to-Text • 4B • Updated 28 days ago • 595k • 227

ServiceNow/GroundNext-7B-V0

Image-Text-to-Text • 8B • Updated about 12 hours ago • 60 • 7

unsloth/Qwen3-VL-30B-A3B-Instruct-GGUF

Image-Text-to-Text • 31B • Updated 1 day ago • 48.6k • 27

google/gemma-3n-E4B-it

Image-Text-to-Text • 8B • Updated Jul 14 • 42k • 815

Qwen/Qwen3-VL-32B-Instruct

Image-Text-to-Text • 33B • Updated 22 days ago • 1.18M • 117

Qwen/Qwen3-VL-8B-Thinking-GGUF

Image-Text-to-Text • 8B • Updated 11 days ago • 3.5k • 9

google/paligemma-3b-pt-224

Image-Text-to-Text • 3B • Updated Sep 21, 2024 • 36.5k • 371