Nathan Lambert's picture

Nathan Lambert

natolambert

·

https://www.natolambert.com/

AI & ML interests

Reinforcement learning, Ethics, Robotics, Dynamics Models

Recent Activity

updated a collection about 2 hours ago

liked a model about 2 hours ago

mobiuslabsgmbh/DeepSeek-R1-ReDistill-Llama3-8B-v1.1

updated a collection about 11 hours ago

View all activity

Articles

Ethics and Society Newsletter #4: Bias in Text-to-Image Models

Can foundation models label data like humans?

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Red-Teaming Large Language Models

What Makes a Dialog Agent Useful?

Illustrating Reinforcement Learning from Human Feedback (RLHF)

Stable Diffusion with 🧨 Diffusers

Organizations

natolambert's activity

New activity in allenai/Llama-3.1-Tulu-3-8B 13 days ago

Adding Evaluation Results

#3 opened 29 days ago by

New activity in allenai/reward-bench 20 days ago

multilingual

#8 opened 27 days ago by

New activity in allenai/reward-bench about 2 months ago

add more contaminated models to the list

#7 opened 3 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 2 months ago

Reason behind not using special tokens in the prompt format?

#2 opened 2 months ago by

New activity in allenai/OLMo-2-1124-13B-Instruct-preview about 2 months ago

What is that instruction template?

#1 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 2 months ago

Why do you use pass@10 to test coding perfmance...

#4 opened 2 months ago by

New activity in allenai/OLMo-2-1124-13B-Instruct-preview about 2 months ago

Has the data set been expanded?

#2 opened 2 months ago by

New activity in allenai/tulu-3-sft-personas-algebra about 2 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 2 months ago by

New activity in allenai/tulu-3-sft-personas-math about 2 months ago

Add link to Tulu 3 paper

#2 opened 2 months ago by

New activity in allenai/llama-3.1-tulu-3-70b-preference-mixture about 2 months ago

Librarian Bot: Add language metadata for dataset

#1 opened 2 months ago by

New activity in allenai/llama-3.1-tulu-3-8b-preference-mixture about 2 months ago

Easy way to separate permissive samples

#1 opened 2 months ago by

New activity in allenai/tulu-3-sft-mixture about 2 months ago

recommend filter

#2 opened 2 months ago by

NuminaMath-TIR License (Apache 2, not CC-BY-NC-4.0)

#3 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B-RM about 2 months ago

Adding `safetensors` variant of this model

#2 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B-SFT about 2 months ago

Adding Evaluation Results

#2 opened 2 months ago by

leaderboard-pr-bot

New activity in allenai/Llama-3.1-Tulu-3-8B-DPO about 2 months ago

Adding `safetensors` variant of this model

#2 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B-DPO about 2 months ago

Adding `safetensors` variant of this model

#3 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-70B about 2 months ago

Spelling Error in Section 5.4 - "then" should be "than"

#3 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B about 2 months ago

Feedback

#2 opened 2 months ago by

New activity in allenai/Llama-3.1-Tulu-3-8B-RM 2 months ago

Update README.md

#1 opened 2 months ago by