Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
87
28
338
Nathan Lambert
natolambert
Follow
gentaiscool's profile picture
NatashaN's profile picture
CallMeDaniel's profile picture
146 followers
·
31 following
https://www.natolambert.com/
natolambert
natolambert
AI & ML interests
Reinforcement learning, Ethics, Robotics, Dynamics Models
Recent Activity
updated
a collection
about 2 hours ago
Artifacts 7
liked
a model
about 2 hours ago
mobiuslabsgmbh/DeepSeek-R1-ReDistill-Llama3-8B-v1.1
updated
a collection
about 11 hours ago
Artifacts 7
View all activity
Articles
Ethics and Society Newsletter #4: Bias in Text-to-Image Models
Jun 26, 2023
•
2
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
26
Red-Teaming Large Language Models
Feb 24, 2023
•
22
What Makes a Dialog Agent Useful?
Jan 24, 2023
•
1
Illustrating Reinforcement Learning from Human Feedback (RLHF)
Dec 9, 2022
•
135
Stable Diffusion with 🧨 Diffusers
Aug 22, 2022
•
43
Organizations
natolambert
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
New activity in
allenai/Llama-3.1-Tulu-3-8B
13 days ago
Adding Evaluation Results
#3 opened 29 days ago by
T145
New activity in
allenai/reward-bench
20 days ago
multilingual
2
#8 opened 27 days ago by
ehartford
New activity in
allenai/reward-bench
about 2 months ago
add more contaminated models to the list
2
#7 opened 3 months ago by
arielgera
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 2 months ago
Reason behind not using special tokens in the prompt format?
2
#2 opened 2 months ago by
Doctor-Shotgun
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 2 months ago
What is that instruction template?
1
#1 opened 2 months ago by
SerialKicked
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 2 months ago
Why do you use pass@10 to test coding perfmance...
1
#4 opened 2 months ago by
Leon-Leee
New activity in
allenai/OLMo-2-1124-13B-Instruct-preview
about 2 months ago
Has the data set been expanded?
1
#2 opened 2 months ago by
win10
New activity in
allenai/tulu-3-sft-personas-algebra
about 2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 2 months ago by
librarian-bot
New activity in
allenai/tulu-3-sft-personas-math
about 2 months ago
Add link to Tulu 3 paper
#2 opened 2 months ago by
gabrielmbmb
New activity in
allenai/llama-3.1-tulu-3-70b-preference-mixture
about 2 months ago
Librarian Bot: Add language metadata for dataset
#1 opened 2 months ago by
librarian-bot
New activity in
allenai/llama-3.1-tulu-3-8b-preference-mixture
about 2 months ago
Easy way to separate permissive samples
1
#1 opened 2 months ago by
RASMUS
New activity in
allenai/tulu-3-sft-mixture
about 2 months ago
recommend filter
1
#2 opened 2 months ago by
ehartford
NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)
1
#3 opened 2 months ago by
rbattle
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
about 2 months ago
Adding `safetensors` variant of this model
#2 opened 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-SFT
about 2 months ago
Adding Evaluation Results
#2 opened 2 months ago by
leaderboard-pr-bot
New activity in
allenai/Llama-3.1-Tulu-3-8B-DPO
about 2 months ago
Adding `safetensors` variant of this model
#2 opened 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B-DPO
about 2 months ago
Adding `safetensors` variant of this model
#3 opened 2 months ago by
SFconvertbot
New activity in
allenai/Llama-3.1-Tulu-3-70B
about 2 months ago
Spelling Error in Section 5.4 - "then" should be "than"
1
#3 opened 2 months ago by
eliuakk
New activity in
allenai/Llama-3.1-Tulu-3-8B
about 2 months ago
Feedback
1
#2 opened 2 months ago by
KeyboardMasher
New activity in
allenai/Llama-3.1-Tulu-3-8B-RM
2 months ago
Update README.md
#1 opened 2 months ago by
reach-vb
Load more