Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
273
6
11
Edward Beeching
edbeeching
Follow
CBold's profile picture
Yugeswar's profile picture
khanhtx8x's profile picture
129 followers
·
28 following
https://edbeeching.github.io/
edbeeching
AI & ML interests
None yet
Recent Activity
published
a model
about 16 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
published
a model
about 16 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
published
a model
about 16 hours ago
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
View all activity
Articles
How NuminaMath Won the 1st AIMO Progress Prize
Jul 11, 2024
•
111
Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent
Apr 22, 2024
•
80
Vision Language Models Explained
Apr 11, 2024
•
248
Constitutional AI with Open LLMs
Feb 1, 2024
•
13
Preference Tuning LLMs with Direct Preference Optimization Methods
Jan 18, 2024
•
43
Can foundation models label data like humans?
Jun 12, 2023
•
1
Creating a Coding Assistant with StarCoder
May 9, 2023
•
1
StackLLaMA: A hands-on guide to train LLaMA with RLHF
Apr 5, 2023
•
26
Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU
Mar 9, 2023
•
37
Train your first Decision Transformer
Sep 8, 2022
•
4
Introducing Decision Transformers on Hugging Face 🤗
Mar 28, 2022
•
4
Organizations
edbeeching
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
published
3 models
about 16 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO
Updated
about 16 hours ago
edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO
Updated
about 16 hours ago
edbeeching/Qwen2.5-1.5B-Open-R1-Distill
Updated
about 16 hours ago
Load more