Beyond Transcription: Mechanistic Interpretability in ASR Paper • 2508.15882 • Published 12 days ago • 83
Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs Paper • 2406.11695 • Published Jun 17, 2024 • 2
Multi-module GRPO: Composing Policy Gradients and Prompt Optimization for Language Model Programs Paper • 2508.04660 • Published 27 days ago • 2
NVSpeech: An Integrated and Scalable Pipeline for Human-Like Speech Modeling with Paralinguistic Vocalizations Paper • 2508.04195 • Published 27 days ago • 1
Train Long, Think Short: Curriculum Learning for Efficient Reasoning Paper • 2508.08940 • Published 21 days ago • 23
When Punctuation Matters: A Large-Scale Comparison of Prompt Robustness Methods for LLMs Paper • 2508.11383 • Published 18 days ago • 39
GEPA: Reflective Prompt Evolution Can Outperform Reinforcement Learning Paper • 2507.19457 • Published Jul 25 • 25
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published Jul 17 • 248
Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment Paper • 2505.18600 • Published May 24 • 48
R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing Paper • 2505.21600 • Published May 27 • 71
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published Mar 31 • 302
Large Language Model Agent: A Survey on Methodology, Applications and Challenges Paper • 2503.21460 • Published Mar 27 • 79
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published Jan 28 • 32
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper • 2501.17161 • Published Jan 28 • 123
Early External Safety Testing of OpenAI's o3-mini: Insights from the Pre-Deployment Evaluation Paper • 2501.17749 • Published Jan 29 • 14
Deliberation in Latent Space via Differentiable Cache Augmentation Paper • 2412.17747 • Published Dec 23, 2024 • 33