Submitted by akhaliq 142 MLGym: A New Framework and Benchmark for Advancing AI Research Agents · 17 authors 2
Submitted by akhaliq 96 SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features · 14 authors 5
Submitted by akhaliq 87 SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines · 95 authors 8
Submitted by msalnikov 60 How Much Knowledge Can You Pack into a LoRA Adapter without Harming LLM? · 7 authors 7
Submitted by akhaliq 31 Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning · 10 authors 3
Submitted by basil2115 25 Discovering highly efficient low-weight quantum error-correcting codes with reinforcement learning · 2 authors 4
Submitted by tsq2000 22 LongWriter-V: Enabling Ultra-Long and High-Fidelity Generation in Vision-Language Models · 11 authors 2
Submitted by vvibt 21 S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning · 9 authors 2
Submitted by Minbyul 19 Does Time Have Its Place? Temporal Heads: Where Language Models Recall Time-specific Information · 5 authors 2
Submitted by xhyandwyy 14 PC-Agent: A Hierarchical Multi-Agent Collaboration Framework for Complex Task Automation on PC · 11 authors 3
Submitted by akhaliq 13 Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation · 11 authors 2
Submitted by arkilpatel 11 How to Get Your LLM to Generate Challenging Problems for Evaluation · 3 authors 2
Submitted by Zheyuan22 10 NAVIG: Natural Language-guided Analysis with Vision Language Models for Image Geo-localization · 4 authors 2
Submitted by akhaliq 9 RelaCtrl: Relevance-Guided Efficient Control for Diffusion Transformers · 11 authors 2
Submitted by yhshu 8 From RAG to Memory: Non-Parametric Continual Learning for Large Language Models · 5 authors 2
Submitted by akhaliq 7 AlphaMaze: Enhancing Large Language Models' Spatial Intelligence via GRPO · 2 authors 2
Submitted by vansin 6 LServe: Efficient Long-sequence LLM Serving with Unified Sparse Attention · 10 authors 2
Submitted by YuchengShi 6 Enhancing Cognition and Explainability of Multimodal Foundation Models with Self-Synthesized Data · 5 authors 3
Submitted by chtmp223 6 CLIPPER: Compression enables long-context synthetic data generation · 3 authors 2
Submitted by michiyasunaga 3 Multimodal RewardBench: Holistic Evaluation of Reward Models for Vision Language Models · 3 authors 2
Submitted by saadob12 3 How Much Do LLMs Hallucinate across Languages? On Multilingual Estimation of LLM Hallucination in the Wild · 3 authors 2
Submitted by dwright37 3 Unstructured Evidence Attribution for Long Context Query Focused Summarization · 5 authors 2
Submitted by Ziruibest 3 Geolocation with Real Human Gameplay Data: A Large-Scale Dataset and Human-Like Reasoning Framework · 9 authors 2
Submitted by nielsr 2 Generating $π$-Functional Molecules Using STGG+ with Active Learning · 5 authors 2
Submitted by danielwusg 1 Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images · 4 authors 2