HuggingFaceM4

company

AI & ML interests

None defined yet.

Recent Activity

lewtun submitted a paper about 2 months ago

Single-minus gluon tree amplitudes are nonzero

lewtun submitted a paper about 2 months ago

Reasoning Cache: Continual Improvement Over Long Horizons via Short-Horizon RL

andito updated a Space about 2 months ago

HuggingFaceM4/reachy_mini_remote_control

View all activity

Organization Card

Community About org cards

HuggingFaceM4 is the multimodal team at Hugging Face, working on vision-language models.

Within this organization on the Hugging Face hub, you can access the Idefics models (version 1 IDEFICS, version 2 Idefics2, version 3 Idefics3), datasets used for the training like OBELICS, WebSight, The Cauldron or Docmatix, and interactive tools to visualize the results.

Collections 5

View 5 collections

spaces 20

IDEFICS Playground

faster-qwen3-tts

Generate speech audio from text with custom or cloned voices

Reachy Mini Remote Control (Multi-User)

Remote control for Reachy Mini robots with authentication

Reachy Mini Key Claim

Request an ephemeral API key using an order number

Gradium Setup

Little space to improve the onboarding to gradium

FineVision: Open Data is All You Need

A new open-source dataset for training VLMs

models 34

HuggingFaceM4/Idefics3-8B-Llama3

Image-Text-to-Text • Updated Dec 2, 2024 • 170k • 302

HuggingFaceM4/Florence-2-DocVQA

Image-Text-to-Text • 0.8B • Updated Oct 30, 2024 • 766 • 65

HuggingFaceM4/idefics2-8b

Image-Text-to-Text • 8B • Updated Oct 14, 2024 • 153k • 621

HuggingFaceM4/idefics2-8b-base

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 1.34k • 28

HuggingFaceM4/idefics2-8b-chatty

Image-Text-to-Text • 8B • Updated Jul 30, 2024 • 57 • 95

HuggingFaceM4/siglip-so400m-14-364-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jul 27, 2024 • 12 • 1

HuggingFaceM4/siglip-so400m-14-700-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated Jun 13, 2024 • 10 • 2

HuggingFaceM4/siglip-so400m-14-384-flash-attn2-navit

Zero-Shot Image Classification • 0.9B • Updated May 9, 2024 • 12 • 1

HuggingFaceM4/idefics2-8b-chatty-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 18 • 5

HuggingFaceM4/idefics2-8b-AWQ

Image-Text-to-Text • 8B • Updated May 6, 2024 • 19 • 26

datasets 82

HuggingFaceM4/FineVisionMax

Viewer • Updated Oct 21, 2025 • 24.2M • 27.8k • 22

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 140k • 479

HuggingFaceM4/lmms-eval-embeddings

Updated Sep 3, 2025 • 320 • 1

HuggingFaceM4/DoclingMatix

Viewer • Updated Jul 31, 2025 • 1.27M • 1.05k • 50

HuggingFaceM4/Caltech-101

Updated Sep 10, 2024 • 238 • 4

HuggingFaceM4/Docmatix

Viewer • Updated Aug 26, 2024 • 2.55M • 8.8k • 300

HuggingFaceM4/the_cauldron

Viewer • Updated May 6, 2024 • 1.88M • 56.5k • 522

HuggingFaceM4/FairFace

Viewer • Updated Apr 11, 2024 • 195k • 1.22k • 29

HuggingFaceM4/MMBench

Viewer • Updated Apr 5, 2024 • 11k • 4.88k • 4

HuggingFaceM4/WebSight

Viewer • Updated Mar 26, 2024 • 2.75M • 19.5k • 389

View 82 datasets