Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 6 days ago • 25
Can Community Notes Replace Professional Fact-Checkers? Paper • 2502.14132 • Published 6 days ago • 5
SWE-Lancer: Can Frontier LLMs Earn $1 Million from Real-World Freelance Software Engineering? Paper • 2502.12115 • Published 8 days ago • 41
Explorer: Scaling Exploration-driven Web Trajectory Synthesis for Multimodal Web Agents Paper • 2502.11357 • Published 8 days ago • 9
System Message Generation for User Preferences using Open-Source Models Paper • 2502.11330 • Published 8 days ago • 15
OctoTools: An Agentic Framework with Extensible Tools for Complex Reasoning Paper • 2502.11271 • Published 9 days ago • 13