Tulu 3 Models Collection All models released with Tulu 3 -- state of the art open post-training recipes. • 10 items • Updated 1 day ago • 55
Atla Selene Mini: A General Purpose Evaluation Model Paper • 2501.17195 • Published 3 days ago • 24 • 4
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 1 day ago • 8 • 2
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Paper • 2501.16937 • Published 3 days ago • 3 • 2
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 2 days ago • 48 • 4
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 2 days ago • 23 • 2
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published 2 days ago • 15 • 4
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 8 days ago • 5 • 2
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper • 2501.15369 • Published 5 days ago • 9 • 2
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity Paper • 2501.16295 • Published 3 days ago • 5 • 1
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 3 days ago • 20 • 3