new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Nov 28

Submitted by

akhaliq

CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models

·
7 authors

Submitted by

vyokky

Large Language Model-Brained GUI Agents: A Survey

·
12 authors

Submitted by

ajhamdi

3D Convex Splatting: Radiance Field Rendering with 3D Smooth Convexes

·
9 authors

Submitted by

primecai

Diffusion Self-Distillation for Zero-Shot Customized Image Generation

·
6 authors

Submitted by

LegendBC

DiffusionDrive: Truncated Diffusion Model for End-to-End Autonomous Driving

·
11 authors

Submitted by

akhaliq

Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters

·
6 authors

Submitted by

Ema97x

DreamCache: Finetuning-Free Lightweight Personalized Image Generation via Feature Caching

·
5 authors

Submitted by

Zigeng

Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient

·
4 authors

Submitted by

Mountchicken

ChatRex: Taming Multimodal LLM for Joint Perception and Understanding

·
8 authors

Submitted by

LiyiGang

UniPose: A Unified Multimodal Framework for Human Pose Comprehension, Generation and Editing

·
5 authors

Submitted by

czyang

Video-Guided Foley Sound Generation with Multimodal Controls

·
7 authors

Submitted by

akhaliq

Omegance: A Single Parameter for Various Granularities in Diffusion-Based Synthesis

·
4 authors

Submitted by

Geralt-Targaryen

Draft Model Knows When to Stop: A Self-Verification Length Policy for Speculative Decoding

·
7 authors

Submitted by

ColorfulAI

VideoLLM Knows When to Speak: Enhancing Time-Sensitive Video Comprehension with Video-Text Duet Interaction Format

·
7 authors

Submitted by

Sarim-Hash

Optimizing Brain Tumor Segmentation with MedNeXt: BraTS 2024 SSA and Pediatrics

·
9 authors

Submitted by

davidserra9

Adaptive Blind All-in-One Image Restoration

·
4 authors

Submitted by

yifAI

Training and Evaluating Language Models with Template-based Data Generation

·
1 authors

Submitted by

luomingshuang

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

·
8 authors

Submitted by

vztu

Edit Away and My Face Will not Stay: Personal Biometric Defense against Malicious Generative Editing

·
6 authors