Implicit Reasoning in Transformers is Reasoning through Shortcuts Paper • 2503.07604 • Published 2 days ago • 17
Unified Reward Model for Multimodal Understanding and Generation Paper • 2503.05236 • Published 5 days ago • 100
Agent-R: Training Language Model Agents to Reflect via Iterative Self-Training Paper • 2501.11425 • Published Jan 20 • 93
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published Jan 14 • 275
Revealing the Barriers of Language Agents in Planning Paper • 2410.12409 • Published Oct 16, 2024 • 27