Blog, Articles, and discussions

mmBERT: ModernBERT goes Multilingual

By September 9, 2025 • 100

Community Articles

view all

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

6 days ago

• 8

Code a simple RAG from scratch

•

Oct 29, 2024

• 204

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

20 days ago

• 102

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

5 days ago

• 6

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

4 days ago

• 6

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 682

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 225

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

13 days ago

• 12

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

about 20 hours ago

• 5

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 70

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 71

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 74

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 47

PrediBench: Testing AI models on prediction markets

and 1 other •

6 days ago

• 4

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 98

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 43

Federated Learning using Hugging Face and Flower

By March 27, 2023 guest

Welcome PaddlePaddle to the Hugging Face Hub

By January 17, 2023 guest • 3

From GPT2 to Stable Diffusion: Hugging Face arrives to the Elixir community

By December 9, 2022

From PyTorch DDP to 🤗 Accelerate to 🤗 Trainer, mastery of distributed training with ease

By October 21, 2022 • 38

Optimization story: Bloom inference

By October 12, 2022 • 7

How 🤗 Accelerate runs very large models thanks to PyTorch

By September 27, 2022 • 14

Introducing Skops

By August 12, 2022 • 1

Introducing The World's Largest Open Multilingual Language Model: BLOOM

By July 12, 2022 • 5

Gradio 3.0 is Out!

By May 16, 2022

Welcome fastai to the Hugging Face Hub

By May 6, 2022 • 2

Introducing Decision Transformers on Hugging Face 🤗

By March 28, 2022 • 7

Welcome Stable-baselines3 to the Hugging Face Hub 🤗

By January 21, 2022

Gradio joins Hugging Face!

By December 21, 2021 • 6

Welcome spaCy to the 🤗 Hub

By July 13, 2021 • 1

Community Articles

There is no such thing as a tokenizer-free lunch

•

5 days ago

• 61

Nemotron-Personas-Japan: Synthesized Data for Sovereign AI

and 6 others •

6 days ago

• 22

RexBERT: Encoders for a brave new world of E-Commerce

and 1 other •

9 days ago

• 44

Model Quality: Hugging Face Is All You Need

•

4 days ago

• 15

Qianfan-VL: A Milestone Achievement in Chinese Multimodal AI with Domestic Chips

•

6 days ago

• 8

Code a simple RAG from scratch

•

Oct 29, 2024

• 204

PP-OCRv5 on Hugging Face: A Specialized Approach to OCR

and 5 others •

20 days ago

• 102

Ground-up efforts to build large datasets for effective and accurate translation of Modi-Script documents into modern Marathi

and 1 other •

5 days ago

• 6

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

and 6 others •

4 days ago

• 6

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 682

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 225

🌎 What kind of environmental impacts are AI companies disclosing? (And can we compare them?) 🌎

and 1 other •

13 days ago

• 12

Preserving Agency: Why AI Safety Needs Community, Not Corporate Control

•

about 20 hours ago

• 5

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 70

Navigating the RLHF Landscape: From Policy Gradients to PPO, GAE, and DPO for LLM Alignment

•

Feb 11

• 71

Small Language Models (SLM): A Comprehensive Overview

•

Feb 22

• 74

Understanding Gemma 3n: How MatFormer Gives You Many Models in One

•

Jun 26

• 47

PrediBench: Testing AI models on prediction markets

and 1 other •

6 days ago

• 4

Mastering Tensor Dimensions in Transformers

•

Jan 12

• 98

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

•

Apr 16

• 43

View all