Running 2.24k 2.24k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation β’ Updated 18 days ago β’ 1.59M β’ β’ 1.26k
cognitivecomputations/dolphin-2.9.2-qwen2-7b Text Generation β’ Updated Jun 18, 2024 β’ 2.11k β’ 67
Running on CPU Upgrade 140 140 Open Arabic LLM Leaderboard π Track, rank and evaluate open Arabic LLMs and chatbots
Running 869 869 FineWeb: decanting the web for the finest text data at scale π· Generate high-quality web text data for LLM training
argilla/distilabel-capybara-dpo-7k-binarized Viewer β’ Updated Jul 16, 2024 β’ 7.56k β’ 2.58k β’ 181