Submitted by akhaliq 27 Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models · 8 authors 1
Submitted by hyeonho-jeong-video 23 Reangle-A-Video: 4D Video Generation as Video-to-Video Translation · 3 authors 1
Submitted by GuyYariv 10 RewardSDS: Aligning Score Distillation via Reward-Weighted Sampling · 3 authors 1
Submitted by yijunyang 10 GTR: Guided Thought Reinforcement Prevents Thought Collapse in RL-based VLM Agent Training · 6 authors 1
Submitted by LihiShalmon 10 More Documents, Same Length: Isolating the Challenge of Multiple Documents in RAG · 5 authors 1
Submitted by Devy1 6 Quantizing Large Language Models for Code Generation: A Differentiated Replication · 5 authors 1
Submitted by KevinQHLin 4 VLog: Video-Language Models by Generative Retrieval of Narration Vocabulary · 2 authors 1
Submitted by SingleZombie 3 Alias-Free Latent Diffusion Models:Improving Fractional Shift Equivariance of Diffusion Latent Space · 4 authors 1
Submitted by ll-13 3 When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning · 8 authors 1
Submitted by gberta 2 BIMBA: Selective-Scan Compression for Long-Range Video Question Answering · 5 authors 1
Submitted by yrshi 2 Multimodal Language Modeling for High-Accuracy Single Cell Transcriptomics Analysis and Generation · 7 authors 1
Submitted by Robot2050 1 MoC: Mixtures of Text Chunking Learners for Retrieval-Augmented Generation System · 8 authors 2
Submitted by Sailorzzcc - Monte Carlo Diffusion for Generalizable Learning-Based RANSAC · 4 authors 1
Submitted by tkreiman - Understanding and Mitigating Distribution Shifts For Machine Learning Force Fields · 2 authors 2
Submitted by mspitzna - PhysicsGen: Can Generative Models Learn from Images to Predict Complex Physical Relations? · 3 authors 1