Quantized Olmo 3 Collection Verified models. All compatible with vLLM for very fast inference. Use the 3.1 models as they are more recent. • 23 items • Updated 9 days ago • 3
Nemotron-Cascade Collection Scaling Cascaded Reinforcement Learning for General-Purpose Reasoning Models • 17 items • Updated 1 day ago • 36
view article Article Apriel-H1: The Surprising Key to Distilling Efficient Reasoning Models Nov 19 • 33
view article Article DeepMath: A lightweight math reasoning Agent with smolagents +1 21 days ago • 30
view article Article Apriel-1.6-15b-Thinker: Cost-efficient Frontier Multimodal Performance 15 days ago • 81
Mistral Large 3 Collection A state-of-the-art, open-weight, general-purpose multimodal model with a granular Mixture-of-Experts architecture. • 4 items • Updated 23 days ago • 80
Manipulation as in Simulation: Enabling Accurate Geometry Perception in Robots Paper • 2509.02530 • Published Sep 2 • 10
DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu • 5 items • Updated about 1 month ago • 31
π_RL: Online RL Fine-tuning for Flow-based Vision-Language-Action Models Paper • 2510.25889 • Published Oct 29 • 64
Unified Reinforcement and Imitation Learning for Vision-Language Models Paper • 2510.19307 • Published Oct 22 • 29
From Pixels to Words -- Towards Native Vision-Language Primitives at Scale Paper • 2510.14979 • Published Oct 16 • 66