Submitted by Vasily 80 When Models Lie, We Learn: Multilingual Span-Level Hallucination Detection with PsiloQA AIRI - Artificial Intelligence Research Institute 5 1
Submitted by dongguanting 77 Agentic Entropy-Balanced Policy Optimization Renmin University of China 678 3
Submitted by taesiri 65 WithAnyone: Towards Controllable and ID Consistent Image Generation StepFun 102 2
Submitted by zichenwen 60 AI for Service: Proactive Assistance with AI Glasses Shanghai Jiao Tong University 1
Submitted by Paranioar 51 From Pixels to Words -- Towards Native Vision-Language Primitives at Scale SenseTime 49 1
Submitted by xiaochonglinghu 46 ImagerySearch: Adaptive Test-Time Search for Video Generation Beyond Semantic Dependency Constraints AMAP-ML 46 1
Submitted by KID-22 30 Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn LLM Agents Ant Group 1 1
Submitted by Keven16 30 LaSeR: Reinforcement Learning with Last-Token Self-Rewarding Tencent Hunyuan 5 1
Submitted by pengyunie 26 TokDrift: When LLM Speaks in Subwords but Code Speaks in Grammar University of Waterloo 4 1
Submitted by mukul54 24 Attention Is All You Need for KV Cache in Diffusion LLMs Mohamed Bin Zayed University of Artificial Intelligence 1
Submitted by taesiri 23 PaddleOCR-VL: Boosting Multilingual Document Parsing via a 0.9B Ultra-Compact Vision-Language Model PaddlePaddle 57.9k 3
Submitted by CheeryLJH 13 VR-Thinker: Boosting Video Reward Models through Thinking-with-Image Reasoning NJU-LINK Lab 14 1
Submitted by kenchan0226 13 Large Language Models Do NOT Really Know What They Don't Know Singapore Management University 1
Submitted by taesiri 12 MathCanvas: Intrinsic Visual Chain-of-Thought for Multimodal Mathematical Reasoning · 14 authors 6 1
Submitted by han1997 10 VLA^2: Empowering Vision-Language-Action Models with an Agentic Framework for Unseen Concept Manipulation Zhejiang University 1
Submitted by XINLI1997 10 COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes Multimodal Art Projection 0 1
Submitted by XINLI1997 9 Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures ByteDance Seed 0 1
Submitted by bclavie 8 Fantastic (small) Retrievers and How to Train Them: mxbai-edge-colbert-v0 Tech Report Mixedbread 1
Submitted by shenweijie 8 Expertise need not monopolize: Action-Specialized Mixture of Experts for Vision-Language-Action Learning · 13 authors 1
Submitted by MilaWang 8 LiveResearchBench: A Live Benchmark for User-Centric Deep Research in the Wild · 10 authors 1
Submitted by jyhong836 8 LLMs Can Get "Brain Rot"! Visual Informatics Group @ University of Texas at Austin 7 1
Submitted by Lakonik 5 pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation Adobe 34 1
Submitted by DaYin 5 LLMs as Scalable, General-Purpose Simulators For Evolving Digital Agent Training UCLA NLP 1
Submitted by hk 5 DialectGen: Benchmarking and Improving Dialect Robustness in Multimodal Generation UCLA NLP 3 1
Submitted by jiwonsong 5 LiteStage: Latency-aware Layer Skipping for Multi-stage Reasoning Seoul National University 0 1
Submitted by HJGO 5 VIST3A: Text-to-3D by Stitching a Multi-view Reconstruction Network to a Video Generator · 6 authors 23 1
Submitted by stefan-it 4 The German Commons - 154 Billion Tokens of Openly Licensed Text for German Language Models CORAL NLP Research 3 1
Submitted by JonasGeiping 3 Efficient Parallel Samplers for Recurrent-Depth Models and Their Connection to Diffusion Language Models ELLIS Institute Tübingen 833 1
Submitted by kylemontgomery 2 Budget-aware Test-time Scaling via Discriminative Verification · 7 authors 1 1
Submitted by Robot2050 2 MoM: Mixtures of Scenario-Aware Document Memories for Retrieval-Augmented Generation Systems · 6 authors 1
Submitted by SP2001 2 Synthesizing Agentic Data for Web Agents with Progressive Difficulty Enhancement Mechanisms · 7 authors 1
Submitted by shaoweiliu 1 Ponimator: Unfolding Interactive Pose for Versatile Human-human Interaction Animation Snapchat Inc. 5 1
Submitted by kylemontgomery 1 Predicting Task Performance with Context-aware Scaling Laws · 7 authors 1 1
Submitted by augustus2011 1 Beyond One World: Benchmarking Super Heros in Role-Playing Across Multiversal Contexts Character-lab 1 1
Submitted by awni00 1 Unlocking Out-of-Distribution Generalization in Transformers via Recursive Latent Space Reasoning · 4 authors 0 1
Submitted by zhangchen1991 1 RAGCap-Bench: Benchmarking Capabilities of LLMs in Agentic Retrieval Augmented Generation Systems National University of Singapore 1
Submitted by wimmerth 1 AnyUp: Universal Feature Upsampling Max Planck Institute for Informatics 186 1
Submitted by aashiqmuhamed 1 RefusalBench: Generative Evaluation of Selective Refusal in Grounded Language Models Amazon AGI 1
Submitted by kedaxiaoqiu 1 SCas4D: Structural Cascaded Optimization for Boosting Persistent 4D Novel View Synthesis University of Illinois at Urbana-Champaign 1
Submitted by ZYao720 - GroundedPRM: Tree-Guided and Fidelity-Aware Process Reward Modeling for Step-Level Reasoning Ludwig Maximilian University of Munich 1
Submitted by NickNickGo - Mirror Speculative Decoding: Breaking the Serial Barrier in LLM Inference Apple 1
Submitted by qiranzou - FML-bench: A Benchmark for Automatic ML Research Agents Highlighting the Importance of Exploration Breadth National University of Singapore 3 1