new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Mar 11

Submitted by

akhaliq

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

·
671 authors

5

Submitted by

akhaliq

DeepSeek-VL: Towards Real-World Vision-Language Understanding

·
14 authors

4

Submitted by

akhaliq

ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment

·
6 authors

Submitted by

akhaliq

Personalized Audiobook Recommendations at Spotify Through Graph Neural Networks

·
14 authors

Submitted by

akhaliq

CogView3: Finer and Faster Text-to-Image Generation via Relay Diffusion

·
9 authors

Submitted by

akhaliq

CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model

·
9 authors

Submitted by

akhaliq

VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion Models

·
8 authors