Running 2.29k 2.29k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Llama-70B Text Generation β’ Updated 23 days ago β’ 334k β’ β’ 635
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 23 days ago β’ 1.62M β’ β’ 1.28k
Running on CPU Upgrade 35 35 OpenLLM French leaderboard π«π· π₯ Explore and compare LLM benchmarks and submit models for evaluation
nvidia/Llama-3.1-Nemotron-70B-Instruct-HF Text Generation β’ Updated Oct 25, 2024 β’ 261k β’ β’ 2.03k
Running 15 15 GPU Memory Calculator LLMTraining π¬ Calculate GPU memory consumption for LLM training