Optimizing Large Language Model Training Using FP4 Quantization Paper β’ 2501.17116 β’ Published 2 days ago β’ 23
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper β’ 2501.12599 β’ Published 9 days ago β’ 77
Can We Generate Images with CoT? Let's Verify and Reinforce Image Generation Step by Step Paper β’ 2501.13926 β’ Published 7 days ago β’ 29
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper β’ 2501.07301 β’ Published 18 days ago β’ 89
TokenVerse: Versatile Multi-concept Personalization in Token Modulation Space Paper β’ 2501.12224 β’ Published 10 days ago β’ 46
MMVU: Measuring Expert-Level Multi-Discipline Video Understanding Paper β’ 2501.12380 β’ Published 9 days ago β’ 79
Test-Time Preference Optimization: On-the-Fly Alignment via Iterative Textual Feedback Paper β’ 2501.12895 β’ Published 9 days ago β’ 51
LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation Paper β’ 2501.05414 β’ Published 21 days ago β’ 1
VideoLLaMA 3: Frontier Multimodal Foundation Models for Image and Video Understanding Paper β’ 2501.13106 β’ Published 8 days ago β’ 75
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper β’ 2501.12326 β’ Published 9 days ago β’ 47
OmniThink: Expanding Knowledge Boundaries in Machine Writing through Thinking Paper β’ 2501.09751 β’ Published 14 days ago β’ 47
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper β’ 2501.11425 β’ Published 11 days ago β’ 85
PaSa: An LLM Agent for Comprehensive Academic Paper Search Paper β’ 2501.10120 β’ Published 14 days ago β’ 41
Do generative video models learn physical principles from watching videos? Paper β’ 2501.09038 β’ Published 16 days ago β’ 31
FAST: Efficient Action Tokenization for Vision-Language-Action Models Paper β’ 2501.09747 β’ Published 14 days ago β’ 23