Atla Selene Mini: A General Purpose Evaluation Model Paper • 2501.17195 • Published 3 days ago • 24 • 4
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published 1 day ago • 8 • 2
TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Paper • 2501.16937 • Published 3 days ago • 3 • 2
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 2 days ago • 48 • 4
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 2 days ago • 23 • 2
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published 2 days ago • 15 • 4
Low-Rank Adapters Meet Neural Architecture Search for LLM Compression Paper • 2501.16372 • Published 8 days ago • 5 • 2
iFormer: Integrating ConvNet and Transformer for Mobile Application Paper • 2501.15369 • Published 5 days ago • 9 • 2
Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity Paper • 2501.16295 • Published 3 days ago • 5 • 1
Towards General-Purpose Model-Free Reinforcement Learning Paper • 2501.16142 • Published 3 days ago • 20 • 3
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper • 2501.13106 • Published 8 days ago • 75 • 3
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 8 days ago • 274 • 4
FilmAgent: A Multi-Agent Framework for End-to-End Film Automation in Virtual 3D Spaces Paper • 2501.12909 • Published 8 days ago • 62 • 3