NLE: Non-autoregressive LLM-based ASR by Transcript Editing Paper • 2603.08397 • Published 10 days ago • 21
MAISI-v2: Accelerated 3D High-Resolution Medical Image Synthesis with Rectified Flow and Region-specific Contrastive Loss Paper • 2508.05772 • Published Aug 7, 2025 • 3
Vad-R1: Towards Video Anomaly Reasoning via Perception-to-Cognition Chain-of-Thought Paper • 2505.19877 • Published May 26, 2025 • 1
PaperBanana: Automating Academic Illustration for AI Scientists Paper • 2601.23265 • Published Jan 30 • 219
AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning Paper • 2601.18631 • Published Jan 26 • 48
OmniTransfer: All-in-one Framework for Spatio-temporal Video Transfer Paper • 2601.14250 • Published Jan 20 • 48
Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation Paper • 2512.24271 • Published Dec 30, 2025 • 64
view article Article How to make NeuTTS-air generate over 200 seconds of audio in a single second. Nov 21, 2025 • 24
Video Reality Test: Can AI-Generated ASMR Videos fool VLMs and Humans? Paper • 2512.13281 • Published Dec 15, 2025 • 65
Graph of Verification: Structured Verification of LLM Reasoning with Directed Acyclic Graphs Paper • 2506.12509 • Published Jun 14, 2025 • 2
Scaling Zero-Shot Reference-to-Video Generation Paper • 2512.06905 • Published Dec 7, 2025 • 29
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length Paper • 2512.04677 • Published Dec 4, 2025 • 175
PaperDebugger: A Plugin-Based Multi-Agent System for In-Editor Academic Writing, Review, and Editing Paper • 2512.02589 • Published Dec 2, 2025 • 73