Edward Beeching's picture

Edward Beeching

edbeeching

·

https://edbeeching.github.io/

edbeeching

AI & ML interests

None yet

Recent Activity

published a model about 16 hours ago

edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

published a model about 16 hours ago

edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO

published a model about 16 hours ago

edbeeching/Qwen2.5-1.5B-Open-R1-Distill

View all activity

Articles

How NuminaMath Won the 1st AIMO Progress Prize

Jack of All Trades, Master of Some, a Multi-Purpose Transformer Agent

Vision Language Models Explained

Constitutional AI with Open LLMs

Preference Tuning LLMs with Direct Preference Optimization Methods

Can foundation models label data like humans?

Creating a Coding Assistant with StarCoder

StackLLaMA: A hands-on guide to train LLaMA with RLHF

Fine-tuning 20B LLMs with RLHF on a 24GB consumer GPU

Train your first Decision Transformer

Introducing Decision Transformers on Hugging Face 🤗

Organizations

edbeeching's activity

published 3 models about 16 hours ago

edbeeching/DeepSeek-R1-Distill-Qwen-1.5B-GRPO

Updated about 16 hours ago

edbeeching/DeepSeek-R1-Distill-Qwen-7B-GRPO

Updated about 16 hours ago

edbeeching/Qwen2.5-1.5B-Open-R1-Distill

Updated about 16 hours ago