Running 2.24k 2.24k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
HF Deep RL Course Collection Models cooked in HF Deep RL Course (https://huggingface.co/learn/deep-rl-course) β’ 1 item β’ Updated Feb 7
Running 534 534 Scaling test-time compute π Enhance math problem solving by scaling test-time compute
Running on CPU Upgrade 67 67 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running 114 114 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation