Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Main
Tasks
1
Libraries
Languages
Licenses
Other
Tasks
Reset Tasks
Text Generation
Any-to-Any
Image-Text-to-Text
Image-to-Text
Image-to-Image
Text-to-Image
Text-to-Video
Text-to-Speech
+ 44
Parameters
Reset Parameters
< 1B
6B
12B
32B
128B
> 500B
< 1B
> 500B
Libraries
PyTorch
google-tensorflow
TensorFlow
JAX
Transformers
Diffusers
sentence-transformers
Safetensors
ONNX
GGUF
Transformers.js
MLX
+ 41
Apps
vLLM
TGI
llama.cpp
MLX LM
LM Studio
Ollama
Jan
+ 7
Inference Providers
Groq
Novita
Nebius AI
Cerebras
SambaNova
Nscale
fal
Hyperbolic
+ 11
Apply filters
Models
9,961
Full-text search
Inference Available
Edit filters
Sort: Trending
Active filters:
image-to-text
Clear all
datalab-to/chandra
Image-to-Text
•
9B
•
Updated
Oct 21, 2025
•
332k
•
451
Salesforce/blip-image-captioning-base
Image-to-Text
•
Updated
Feb 3, 2025
•
1.61M
•
833
allenai/olmOCR-2-7B-1025
Image-to-Text
•
8B
•
Updated
Oct 22, 2025
•
62.1k
•
120
allenai/olmOCR-2-7B-1025-FP8
Image-to-Text
•
8B
•
Updated
29 days ago
•
1.36M
•
172
nvidia/nemotron-ocr-v1
Image-to-Text
•
Updated
22 days ago
•
170
•
58
monkt/paddleocr-onnx
Image-to-Text
•
Updated
Oct 7, 2025
•
30
sugartai/Qwen3-VL-4B-Uni-MuMER-Final
Image-to-Text
•
4B
•
Updated
2 days ago
•
6
•
3
nlpconnect/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Feb 27, 2023
•
1.12M
•
923
facebook/nougat-base
Image-to-Text
•
0.3B
•
Updated
Nov 20, 2023
•
6.01k
•
182
LanguageBind/Video-LLaVA-7B-hf
Image-to-Text
•
7B
•
Updated
May 16, 2024
•
7.99k
•
47
VLM2Vec/VLM2Vec-V2.0
Image-to-Text
•
Updated
Jul 13, 2025
•
9.72k
•
24
PaddlePaddle/PP-OCRv5_server_det
Image-to-Text
•
Updated
Jul 22, 2025
•
317k
•
52
XiaomiMiMo/MiMo-Embodied-7B
Image-to-Text
•
8B
•
Updated
Nov 21, 2025
•
375
•
58
kha-white/manga-ocr-base
Image-to-Text
•
Updated
Jun 22, 2022
•
178k
•
163
microsoft/trocr-base-printed
Image-to-Text
•
0.3B
•
Updated
May 27, 2024
•
238k
•
201
microsoft/trocr-large-handwritten
Image-to-Text
•
Updated
May 27, 2024
•
18.5k
•
133
microsoft/trocr-large-printed
Image-to-Text
•
0.6B
•
Updated
May 27, 2024
•
128k
•
178
naver-clova-ix/donut-base
Image-to-Text
•
Updated
Aug 13, 2022
•
175k
•
241
microsoft/git-base-coco
Image-to-Text
•
Updated
Feb 8, 2023
•
55.1k
•
20
Salesforce/blip-image-captioning-large
Image-to-Text
•
0.5B
•
Updated
Feb 3, 2025
•
1.55M
•
1.44k
Salesforce/blip2-opt-2.7b-coco
Image-to-Text
•
4B
•
Updated
Feb 3, 2025
•
314k
•
11
Xenova/vit-gpt2-image-captioning
Image-to-Text
•
Updated
Oct 8, 2024
•
5.16k
•
27
bidiptas/PG-InstructBLIP
Image-to-Text
•
Updated
Jan 22, 2024
•
16
hezarai/crnn-base-fa-v1
Image-to-Text
•
Updated
Apr 14, 2025
•
37
•
5
microsoft/kosmos-2-patch14-224
Image-to-Text
•
2B
•
Updated
Nov 28, 2023
•
152k
•
182
OleehyO/TexTeller
Image-to-Text
•
0.3B
•
Updated
Jun 22, 2024
•
140k
•
41
breezedeus/pix2text-mfr
Image-to-Text
•
Updated
May 5, 2024
•
109k
•
50
Mozilla/distilvit
Image-to-Text
•
0.2B
•
Updated
Nov 25, 2024
•
123
•
25
unsloth/Llama-3.2-11B-Vision-Instruct
Image-to-Text
•
11B
•
Updated
Dec 10, 2024
•
24.9k
•
87
unsloth/Llama-3.2-90B-Vision
Image-to-Text
•
89B
•
Updated
Jun 3, 2025
•
33
•
4
Previous
1
2
3
...
100
Next