Thomas Bouvier

tbouvier

https://thomas-bouvier.io

AI & ML interests

HPC for ML, large-scale pretraining, AI4Science

Recent Activity

liked a dataset about 2 months ago

LEAP/ClimSim_high-res

upvoted an article about 2 months ago

Finally, a Replacement for BERT: Introducing ModernBERT

liked a dataset 3 months ago

mcherukara/PtychoNN_data

View all activity

Organizations

None yet

liked a dataset about 2 months ago

LEAP/ClimSim_high-res

Updated Sep 29, 2023 • 1.82k • 12

upvoted an article about 2 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 680

liked a dataset 3 months ago

mcherukara/PtychoNN_data

Updated Mar 18 • 11 • 1

liked 2 models 4 months ago

allenai/ACE2-ERA5

Updated Jul 16 • 22 • 6

microsoft/aurora

Updated Jun 20 • 41

upvoted an article 5 months ago

Article

Efficient LLM Pretraining: Packed Sequences and Masked Attention

•

Oct 7, 2024

• 48

liked 3 Spaces 6 months ago

Memory Viz

🧠

Memory Viz

Predict Memory

🧮

Calculate memory usage for model configurations

3.14k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted an article 7 months ago

Article

Open-R1: Update #1

and 7 others •

Feb 2

• 305

liked 2 datasets 7 months ago

PleIAs/common_corpus

Viewer • Updated Jun 10 • 470M • 14.7k • 305

HuggingFaceFW/fineweb-edu

Viewer • Updated Jul 11 • 3.5B • 116k • 741

liked 2 models 7 months ago

mistralai/Mistral-Small-24B-Base-2501

24B • Updated Jul 28 • 22.4k • 257

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 549k • 625

liked a model 8 months ago

deepseek-ai/DeepSeek-V3

Text Generation • 685B • Updated Mar 27 • 375k • • 3.96k

upvoted a collection 8 months ago

ModernBERT

Collection

Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 150

liked a model 8 months ago

answerdotai/ModernBERT-base

Fill-Mask • 0.1B • Updated Jan 15 • 898k • 924

liked 2 Spaces 8 months ago

TheWell

🌍

Visualization of data from the Well

1.06k

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a model 8 months ago

deepseek-ai/DeepSeek-V3-Base

685B • Updated Mar 27 • 45.5k • 1.67k

Thomas Bouvier

AI & ML interests

Recent Activity

Organizations

tbouvier's activity

Finally, a Replacement for BERT: Introducing ModernBERT

Efficient LLM Pretraining: Packed Sequences and Masked Attention

Memory Viz

Predict Memory

The Ultra-Scale Playbook

Open-R1: Update #1

TheWell

FineWeb: decanting the web for the finest text data at scale