view article Article How NuminaMath Won the 1st AIMO Progress Prize By yfleureau and 7 others • Jul 11, 2024 • 122
view article Article SmolLM - blazingly fast and remarkably powerful By loubnabnl and 2 others • Jul 16, 2024 • 412
view article Article Introducing RWKV — An RNN with the advantages of a transformer By BlinkDL and 3 others • May 15, 2023 • 23
view article Article GaLore: Advancing Large Model Training on Consumer-grade Hardware By Titus-von-Koeller and 8 others • Mar 20, 2024 • 32
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 624
StableIdentity: Inserting Anybody into Anywhere at First Sight Paper • 2401.15975 • Published Jan 29, 2024 • 19