SANA-1.5 Collection SANA-1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer ā¢ 6 items ā¢ Updated 2 days ago ā¢ 1
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper ā¢ 2502.02737 ā¢ Published Feb 4 ā¢ 208
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper ā¢ 2503.10460 ā¢ Published 8 days ago ā¢ 25
EuroBERT Collection Scaling Multilingual Encoders for European Languages ā¢ 4 items ā¢ Updated 11 days ago ā¢ 8
Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing Paper ā¢ 2502.14458 ā¢ Published 29 days ago ā¢ 2
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers Paper ā¢ 2502.20545 ā¢ Published 22 days ago ā¢ 20
Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads Paper ā¢ 2401.10774 ā¢ Published Jan 19, 2024 ā¢ 55
Hydra: Sequentially-Dependent Draft Heads for Medusa Decoding Paper ā¢ 2402.05109 ā¢ Published Feb 7, 2024 ā¢ 1
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper ā¢ 2501.12599 ā¢ Published Jan 22 ā¢ 105