ByteDance-Seed/Seed-OSS-36B-Instruct Text Generation • 36B • Updated 7 days ago • 15.3k • 381
Running 120 120 TxT360: Trillion Extracted Text 📖 Create a large-scale deduplicated text dataset for LLM training
MAGA: MAssive Genre-Audience Reformulation to Pretraining Corpus Expansion Paper • 2502.04235 • Published Feb 6 • 22
Running 3.14k 3.14k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Snowflake/snowflake-arctic-embed-m Sentence Similarity • 0.1B • Updated Dec 13, 2024 • 390k • 156