Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
1168
89
547
Lewis Tunstall
PRO
lewtun
Follow
leeloolee's profile picture
shubhamIITpkd's profile picture
SohilG's profile picture
830 followers
·
69 following
https://lewtun.github.io/blog/
_lewtun
lewtun
AI & ML interests
LLMs, LLMs, LLMs
Articles
Open-R1: a fully open reproduction of DeepSeek-R1
3 days ago
•
460
Universal Assisted Generation: Faster Decoding with Any Assistant Model
Oct 29, 2024
•
52
Faster Assisted Generation with Dynamic Speculation
Oct 8, 2024
•
44
Llama can now see and run on your device - welcome Llama 3.2
Sep 25, 2024
•
182
FineVideo: behind the scenes
Sep 23, 2024
•
28
How NuminaMath Won the 1st AIMO Progress Prize
Jul 11, 2024
•
111
Welcome Gemma 2 - Google's new open LLM
Jun 27, 2024
•
126
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18, 2024
•
43
Mixture of Experts Explained
Dec 11, 2023
•
275
Welcome Mixtral - a SOTA Mixture of Experts on Hugging Face
Dec 11, 2023
•
11
SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit
Dec 6, 2023
•
6
Fine-tuning Llama 2 70B using PyTorch FSDP
Sep 13, 2023
•
16
Code Llama: Llama 2 learns to code
Aug 25, 2023
•
9
Llama 2 is here - get it on Hugging Face
Jul 18, 2023
•
24
Can foundation models label data like humans?
Jun 12, 2023
•
1
The Falcon has landed in the Hugging Face ecosystem
Jun 5, 2023
•
12
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
26
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
Mar 9, 2023
•
37
Red-Teaming Large Language Models
Feb 24, 2023
•
22
Diffusion Models Live Event
Nov 25, 2022
Very Large Language Models and How to Evaluate Them
Oct 3, 2022
•
1
SetFit: Efficient Few-Shot Learning Without Prompts
Sep 26, 2022
•
22
Announcing Evaluation on the Hub
Jun 28, 2022
Organizations
lewtun
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
liked
a model
about 9 hours ago
mistralai/Mistral-Small-24B-Instruct-2501
Text Generation
•
Updated
about 7 hours ago
•
289
liked
a model
about 10 hours ago
mistralai/Mistral-Small-24B-Base-2501
Text Generation
•
Updated
about 8 hours ago
•
120
liked
a dataset
about 12 hours ago
open-r1/OpenThoughts-114k-math
Viewer
•
Updated
about 14 hours ago
•
89.1k
•
13
•
10
liked
a dataset
about 16 hours ago
cognitivecomputations/dolphin-r1
Viewer
•
Updated
about 6 hours ago
•
814k
•
20
•
68
liked
2 datasets
2 days ago
ServiceNow-AI/R1-Distill-SFT
Viewer
•
Updated
2 days ago
•
1.85M
•
317
•
94
open-thoughts/OpenThoughts-114k
Viewer
•
Updated
1 day ago
•
114k
•
5.18k
•
148
liked
4 models
7 days ago
RLHFlow/Llama3.1-8B-PRM-Deepseek-Data
Text Generation
•
Updated
Nov 9, 2024
•
21.7k
•
33
meta-llama/Llama-3.2-1B-Instruct
Text Generation
•
Updated
Oct 24, 2024
•
1.32M
•
728
HuggingFaceTB/SmolVLM-256M-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
11.7k
•
110
HuggingFaceTB/SmolVLM-500M-Instruct
Image-Text-to-Text
•
Updated
8 days ago
•
7.72k
•
82
liked
2 models
9 days ago
deepseek-ai/DeepSeek-R1-Distill-Qwen-1.5B
Text Generation
•
Updated
5 days ago
•
225k
•
570
deepseek-ai/DeepSeek-R1
Text Generation
•
Updated
5 days ago
•
498k
•
5.31k
liked
a Space
24 days ago
Running
34
🥇
MEGA-Bench
A leaderboard for multimodal models
liked
a model
24 days ago
HuggingFaceTB/FineMath-Llama-3B
Updated
24 days ago
•
221
•
13
liked
a dataset
25 days ago
HuggingFaceH4/MATH-500
Viewer
•
Updated
Nov 15, 2024
•
500
•
13.9k
•
62
liked
a model
25 days ago
deepseek-ai/DeepSeek-V3
Text Generation
•
Updated
7 days ago
•
409k
•
2.87k
liked
a model
29 days ago
Skywork/Skywork-o1-Open-PRM-Qwen-2.5-1.5B
Text Classification
•
Updated
Nov 27, 2024
•
1.27k
•
25
liked
a Space
30 days ago
Running
429
📈
2024 AI Timeline
liked
a model
about 1 month ago
deepseek-ai/DeepSeek-V3-Base
Updated
7 days ago
•
23.4k
•
1.47k
liked
a Space
about 1 month ago
Running
240
🏃
Jupyter Agent
Load more