Edit Models filters

Inference Providers

HF Inference API

Misc

vision-language-model

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Carbon Emissions

Mixture of Experts

Models

40

Full-text search

Active filters: vision-language-model

ByteDance/Dolphin

Image-Text-to-Text • 0.4B • Updated Jul 16 • 87.7k • 472

remyxai/SpaceLLaVA

Image-Text-to-Text • 13B • Updated Apr 20 • 153 • 25

airtightsquid/TA_GLM4.5_lora_finetune

Image Classification • Updated 16 days ago • 5.1k • 1

InternRobotics/InternVLA-N1

Robotics • 8B • Updated 1 day ago • 68 • 1

deadzzz/qwen_VLM_finetuning

Updated Oct 24, 2024

xiaorui638/flair

Updated Mar 6 • 3 • 5

wingrune/3DGraphLLM

Image-Text-to-Text • Updated 18 days ago

SVECTOR-CORPORATION/Spec-Vision-V1

Image-Text-to-Text • 4B • Updated Feb 11 • 11 • 4

Duino/Duino-Lidar

Depth Estimation • Updated Feb 18 • 7

sankim2/cosmos

Image-Text-to-Text • Updated Mar 27 • 2 • 1

yjj23/minivlm

Updated Apr 20 • 7

samihalawa/APOLO-medical-multimodal-instruct

Image-Text-to-Text • Updated May 8 • 2

daniel3303/QwenStoryteller

Image-to-Text • 8B • Updated May 16 • 68 • 8

mradermacher/QwenStoryteller-GGUF

Image-to-Text • 8B • Updated Jul 31 • 165

mradermacher/QwenStoryteller-i1-GGUF

Image-to-Text • 8B • Updated Jul 11 • 220 • 1

lordChipotle/nutrition-label-detector

Image-Text-to-Text • 5B • Updated May 19 • 82

truworthai/DynamicVisualLearning-v2-mlx

truworthai/FixedDynamicLearning-v3-mlx

truworthai/FinalVisualLearning-v4-mlx

truworthai/verynew

truworthai/testhellow

truworthai/Combined-mlx

Updated Jun 3 • 3 • 1

humbleakh/qwen2.5-vl-3b-8bit-chain-of-zoom

Image-to-Text • Updated Jun 8 • 9 • 1

phronetic-ai/owlet-har-1

Video Classification • 4B • Updated Jun 23 • 282

convaiinnovations/ECG-Instruct-Llama-3.2-11B-Vision

Text Generation • 11B • Updated Jun 19 • 20

gribok201/smolvla

Robotics • Updated Jun 19 • 3

daniel3303/QwenStoryteller2

Image-to-Text • 8B • Updated Jul 1 • 27 • 2

mradermacher/QwenStoryteller2-GGUF

Image-to-Text • 8B • Updated Jul 31 • 204

mradermacher/QwenStoryteller2-i1-GGUF

Image-to-Text • 8B • Updated Jul 11 • 146

luquiT4/DolphinInference

Image-Text-to-Text • 0.4B • Updated Jul 4 • 2