Classic Reinforcement Learning Collection solved classic rl environments • 2 items • Updated 22 days ago
Classic Reinforcement Learning Collection solved classic rl environments • 2 items • Updated 22 days ago
view article Article Prefill and Decode for Concurrent Requests - Optimizing LLM Performance Apr 16, 2025 • 58
Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper • 2508.05629 • Published Aug 7, 2025 • 180
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 627
AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs Paper • 2507.05687 • Published Jul 8, 2025 • 27
SingLoRA: Low Rank Adaptation Using a Single Matrix Paper • 2507.05566 • Published Jul 8, 2025 • 113