Sketch-of-Thought: Efficient LLM Reasoning with Adaptive Cognitive-Inspired Sketching Paper • 2503.05179 • Published 5 days ago • 42
SafeArena: Evaluating the Safety of Autonomous Web Agents Paper • 2503.04957 • Published 5 days ago • 17
Learning from Failures in Multi-Attempt Reinforcement Learning Paper • 2503.04808 • Published 8 days ago • 14
LLM as a Broken Telephone: Iterative Generation Distorts Information Paper • 2502.20258 • Published 13 days ago • 21
How to Steer LLM Latents for Hallucination Detection? Paper • 2503.01917 • Published 11 days ago • 10
Identifying Sensitive Weights via Post-quantization Integral Paper • 2503.01901 • Published 12 days ago • 7