Lewis Tunstall's picture

Lewis Tunstall PRO

lewtun

·

https://lewtun.github.io/blog/

AI & ML interests

LLMs, LLMs, LLMs

Articles

Open-R1: a fully open reproduction of DeepSeek-R1

Universal Assisted Generation: Faster Decoding with Any Assistant Model

Faster Assisted Generation with Dynamic Speculation

Llama can now see and run on your device - welcome Llama 3.2

FineVideo: behind the scenes

How NuminaMath Won the 1st AIMO Progress Prize

Welcome Gemma 2 - Google's new open LLM

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Mixture of Experts Explained

Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Fine-tuning Llama 2 70B using PyTorch FSDP

Code Llama: Llama 2 learns to code

Llama 2 is here - get it on Hugging Face

Can foundation models label data like humans?

The Falcon has landed in the Hugging Face ecosystem

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Red-Teaming Large Language Models

Diffusion Models Live Event

Very Large Language Models and How to Evaluate Them

SetFit: Efficient Few-Shot Learning Without Prompts

Announcing Evaluation on the Hub

Organizations

lewtun's activity

liked a model about 9 hours ago

mistralai/Mistral-Small-24B-Instruct-2501

Text Generation • Updated about 7 hours ago • 289

liked a model about 10 hours ago

mistralai/Mistral-Small-24B-Base-2501

Text Generation • Updated about 8 hours ago • 120

liked a dataset about 12 hours ago

open-r1/OpenThoughts-114k-math

Viewer • Updated about 14 hours ago • 89.1k • 13 • 10

liked a dataset about 16 hours ago

cognitivecomputations/dolphin-r1

Viewer • Updated about 6 hours ago • 814k • 20 • 68

liked 2 datasets 2 days ago

ServiceNow-AI/R1-Distill-SFT

Viewer • Updated 2 days ago • 1.85M • 317 • 94

open-thoughts/OpenThoughts-114k

Viewer • Updated 1 day ago • 114k • 5.18k • 148

liked 4 models 7 days ago

RLHFlow/Llama3.1-8B-PRM-Deepseek-Data

Text Generation • Updated Nov 9, 2024 • 21.7k • 33

meta-llama/Llama-3.2-1B-Instruct

Text Generation • Updated Oct 24, 2024 • 1.32M • 728

HuggingFaceTB/SmolVLM-256M-Instruct

Image-Text-to-Text • Updated 8 days ago • 11.7k • 110

HuggingFaceTB/SmolVLM-500M-Instruct

Image-Text-to-Text • Updated 8 days ago • 7.72k • 82

liked 2 models 9 days ago

deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B

Text Generation • Updated 5 days ago • 225k • 570

deepseek-ai/DeepSeek-R1

Text Generation • Updated 5 days ago • 498k • 5.31k

liked a Space 24 days ago

MEGA-Bench

A leaderboard for multimodal models

liked a model 24 days ago

HuggingFaceTB/FineMath-Llama-3B

Updated 24 days ago • 221 • 13

liked a dataset 25 days ago

HuggingFaceH4/MATH-500

Viewer • Updated Nov 15, 2024 • 500 • 13.9k • 62

liked a model 25 days ago

deepseek-ai/DeepSeek-V3

Text Generation • Updated 7 days ago • 409k • 2.87k

liked a model 29 days ago

Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B

Text Classification • Updated Nov 27, 2024 • 1.27k • 25

liked a Space 30 days ago

2024 AI Timeline

liked a model about 1 month ago

deepseek-ai/DeepSeek-V3-Base

Updated 7 days ago • 23.4k • 1.47k

liked a Space about 1 month ago

Jupyter Agent