new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Mar 22

Submitted by

akhaliq

MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?

·
11 authors

Submitted by

akhaliq

DreamReward: Text-to-3D Generation with Human Preference

·
8 authors

Submitted by

akhaliq

Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference

·
6 authors

Submitted by

akhaliq

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

·
5 authors

Submitted by

akhaliq

ReNoise: Real Image Inversion Through Iterative Noising

·
5 authors

Submitted by

akhaliq

Efficient Video Diffusion Models via Content-Frame Motion-Latent Decomposition

·
6 authors

Submitted by

akhaliq

MyVLM: Personalizing VLMs for User-Specific Queries

·
5 authors

Submitted by

akhaliq

GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation

·
8 authors

Submitted by

akhaliq

Gaussian Frosting: Editable Complex Radiance Fields with Real-Time Rendering

·
2 authors

Submitted by

akhaliq

Explorative Inbetweening of Time and Space

·
7 authors

Submitted by

akhaliq

StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN

·
4 authors

Submitted by

akhaliq

Recourse for reclamation: Chatting with generative language models

·
4 authors