Stephen Oates PRO

soates

AI & ML interests

None yet

Recent Activity

liked a Space 4 days ago

nanotron/ultrascale-playbook

upvoted an article 22 days ago

Open-R1: Update #1

upvoted an article 27 days ago

Open-R1: a fully open reproduction of DeepSeek-R1

View all activity

Organizations

None yet

soates's activity

liked a Space 4 days ago

1.4k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 2 months ago

Datou1111/shou_xin

Text-to-Image • Updated Dec 9, 2024 • 2.18k • 863

liked a model 5 months ago

lamm-mit/LifeGPT

Updated Sep 19, 2024 • 8

liked a Space 6 months ago

112

Open-LLM performances are plateauing, let’s make the leaderboard steep again

🏔

Update leaderboard for fair model evaluation

liked a model 7 months ago

nisten/Biggie-SmoLlm-0.15B-Base

Text Generation • Updated Aug 7, 2024 • 923 • • 233

liked a Space 7 months ago

Gpt2 Multiplication Predictor

📈

Multiply large numbers using different reasoning methods

liked a Space 9 months ago

774

FineWeb: decanting the web for the finest text data at scale

🍷

Generate high-quality web text data for LLM training

liked a Space 10 months ago

270

Phi-3 WebGPU

🚀

A private and powerful AI that runs locally in your browser

liked a model 10 months ago

rombodawg/test_dataset_Codellama-3-8B

Text Generation • Updated May 4, 2024 • 140 • 78