CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published about 24 hours ago • 11
Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach Paper • 2502.05171 • Published 5 days ago • 51
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper • 2502.06703 • Published 1 day ago • 86
On-device Sora: Enabling Diffusion-Based Text-to-Video Generation for Mobile Devices Paper • 2502.04363 • Published 7 days ago • 9
Can LLMs Maintain Fundamental Abilities under KV Cache Compression? Paper • 2502.01941 • Published 8 days ago • 11
The Differences Between Direct Alignment Algorithms are a Blur Paper • 2502.01237 • Published 9 days ago • 109
Reward-Guided Speculative Decoding for Efficient LLM Reasoning Paper • 2501.19324 • Published 12 days ago • 34
GuardReasoner: Towards Reasoning-based LLM Safeguards Paper • 2501.18492 • Published 13 days ago • 80
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published 15 days ago • 102
Optimizing Large Language Model Training Using FP4 Quantization Paper • 2501.17116 • Published 15 days ago • 33
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 21 days ago • 316
Hunyuan3D 2.0: Scaling Diffusion Models for High Resolution Textured 3D Assets Generation Paper • 2501.12202 • Published 22 days ago • 33
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 29 days ago • 61