Submitted by akhaliq 41 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models · 8 authors 3
Submitted by hyeonho-jeong-video 27 Reangle-A-Video: 4D Video Generation as Video-to-Video Translation · 3 authors 2
Submitted by GuyYariv 13 RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling · 3 authors 2
Submitted by yijunyang 13 GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training · 6 authors 2
Submitted by LihiShalmon 13 More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG · 5 authors 2
Submitted by nielsr 7 Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning · 6 authors 2
Submitted by KevinQHLin 6 VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary · 2 authors 2
Submitted by ll-13 6 When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning · 8 authors 3
Submitted by mspitzna 6 PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations? · 3 authors 2
Submitted by Devy1 5 Quantizing Large Language Models for Code Generation: A Differentiated Replication · 5 authors 2
Submitted by SingleZombie 4 Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space · 4 authors 2
Submitted by Robot2050 3 MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System · 8 authors 3
Submitted by yrshi 3 Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation · 7 authors 2
Submitted by gberta 2 BIMBA: Selective-Scan Compression for Long-Range Video Question Answering · 5 authors 2
Submitted by Sailorzzcc - Monte Carlo Diffusion for Generalizable Learning-Based RANSAC · 4 authors 2
Submitted by tkreiman - Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields · 2 authors 3