Running 3.62k The Ultra-Scale Playbook 🌌 3.62k The ultimate guide to training LLM on large GPU Clusters
view article Article What’s MXFP4? The 4-Bit Secret Powering OpenAI’s GPT‑OSS Models on Modest Hardware Aug 8, 2025 • 29
Less is More: Recursive Reasoning with Tiny Networks Paper • 2510.04871 • Published Oct 6, 2025 • 501
Quantization Robustness to Input Degradations for Object Detection Paper • 2508.19600 • Published Aug 27, 2025 • 1
view article Article KV Caching Explained: Optimizing Transformer Inference Efficiency Jan 30, 2025 • 210
Quantization Robustness to Input Degradations for Object Detection Paper • 2508.19600 • Published Aug 27, 2025 • 1 • 2
Quantization Robustness to Input Degradations for Object Detection Paper • 2508.19600 • Published Aug 27, 2025 • 1