view article Article The N Implementation Details of RLHF with PPO By vwxyzjn and 2 others • Oct 24, 2023 • 67
view article Article Welcome GPT OSS, the new open-source model family from OpenAI! By reach-vb and 11 others • 28 days ago • 481
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) By natolambert and 3 others • Dec 9, 2022 • 332
view article Article SmolLM3: smol, multilingual, long-context reasoner By loubnabnl and 22 others • Jul 8 • 643
view article Article Open LLM Leaderboard: DROP deep dive By clefourrier and 4 others • Dec 1, 2023 • 9
view article Article What's going on with the Open LLM Leaderboard? By clefourrier and 3 others • Jun 23, 2023 • 43