Running 2.23k 2.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
NousResearch/DeepHermes-3-Llama-3-8B-Preview Text Generation • Updated about 2 hours ago • 17.4k • 291
Running 533 533 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute