13 15 19

Garreth Lee

garrethlee

AI & ML interests

None yet

Recent Activity

liked a Space 5 months ago

HuggingFaceTB/smol-training-playbook

liked a dataset 7 months ago

HuggingFaceM4/FineVision

liked a model 7 months ago

google/embeddinggemma-300m

View all activity

Organizations

liked a Space 5 months ago

The Smol Training Playbook

📚

3.05k

The secrets to building world-class LLMs

liked a dataset 7 months ago

HuggingFaceM4/FineVision

Viewer • Updated Oct 21, 2025 • 24.2M • 126k • 474

liked a model 7 months ago

google/embeddinggemma-300m

liked a dataset 7 months ago

nvidia/Granary

Viewer • Updated 7 days ago • 141M • 4.02k • 189

upvoted 2 papers 9 months ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26, 2025 • 77

OWSM v4: Improving Open Whisper-Style Speech Models via Data Scaling and Cleaning

Paper • 2506.00338 • Published May 31, 2025 • 10

upvoted a changelog 10 months ago

Hugging Face Changelog

Xet is now the default storage option for new users and organizations

May 23, 2025

• 76

liked a Space 11 months ago

Dia 1.6B

👯

1.75k

Generate realistic dialogue from a script, using Dia!

upvoted a collection 12 months ago

Llama 4

Collection

Llama 4 release • 13 items • Updated Apr 29, 2025 • 710

upvoted an article 12 months ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

Mar 24, 2025

•

upvoted an article about 1 year ago

Article

FastRTC: The Real-Time Communication Library for Python

Feb 25, 2025

•

172

liked a Space about 1 year ago

The Ultra-Scale Playbook

🌌

3.74k

The ultimate guide to training LLM on large GPU Clusters

upvoted 3 articles about 1 year ago

Article

1 Billion Classifications

Feb 13, 2025

•

Article

KV Caching Explained: Optimizing Transformer Inference Efficiency

Jan 30, 2025

•

245

Article

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Jan 29, 2025

•

liked a model about 1 year ago

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 1.43M • • 13.1k

upvoted a paper about 1 year ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 377

updated a Space over 1 year ago

Number Tokenization Blog

📈

115

Explore how tokenization affects arithmetic in LLMs

liked a dataset over 1 year ago

HuggingFaceFW/fineweb-2

Viewer • Updated Oct 27, 2025 • 4.48B • 66.1k • 768

liked a Space over 1 year ago

Number Tokenization Blog

📈

115

Explore how tokenization affects arithmetic in LLMs

Garreth Lee

AI & ML interests

Recent Activity

Organizations

garrethlee's activity

The Smol Training Playbook

Xet is now the default storage option for new users and organizations

Dia 1.6B

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

FastRTC: The Real-Time Communication Library for Python

The Ultra-Scale Playbook

1 Billion Classifications

KV Caching Explained: Optimizing Transformer Inference Efficiency

How biased is Whisper ? Evaluating Whisper Models for Robustness to Diverse English Accents

Number Tokenization Blog

Number Tokenization Blog