Submitted by akhaliq 51 CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models · 7 authors 5
Submitted by ajhamdi 17 3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes · 9 authors 5
Submitted by primecai 15 Diffusion Self-Distillation for Zero-Shot Customized Image Generation · 6 authors 6
Submitted by LegendBC 15 DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving · 11 authors 2
Submitted by akhaliq 14 Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters · 6 authors 4
Submitted by Ema97x 12 DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching · 5 authors 3
Submitted by Zigeng 12 Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient · 4 authors 2
Submitted by Mountchicken 10 ChatRex: Taming Multimodal LLM for Joint Perception and Understanding · 8 authors 3
Submitted by LiyiGang 10 UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing · 5 authors 3
Submitted by akhaliq 7 Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis · 4 authors 2
Submitted by Geralt-Targaryen 6 Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding · 7 authors 2
Submitted by ColorfulAI 5 VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format · 7 authors 2
Submitted by Sarim-Hash 5 Optimizing Brain Tumor Segmentation with MedNeXt: BraTS 2024 SSA and Pediatrics · 9 authors 2
Submitted by yifAI 3 Training and Evaluating Language Models with Template-based Data Generation · 1 authors 3
Submitted by luomingshuang 2 Morph: A Motion-free Physics Optimization Framework for Human Motion Generation · 8 authors 2
Submitted by vztu 2 Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing · 6 authors 3