new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

by AK and the research community

Feb 28

Submitted by

akhaliq

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

·
10 authors

Submitted by

akhaliq

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

·
4 authors

Submitted by

akhaliq

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

·
12 authors

Submitted by

akhaliq

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

·
4 authors

Submitted by

akhaliq

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

·
7 authors

Submitted by

akhaliq

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model

·
6 authors

Submitted by

akhaliq

Training-Free Long-Context Scaling of Large Language Models

·
7 authors

Submitted by

akhaliq

Evaluating Very Long-Term Conversational Memory of LLM Agents

·
6 authors

Submitted by

akhaliq

Video as the New Language for Real-World Decision Making

·
8 authors

Submitted by

akhaliq

Towards Optimal Learning of Language Models

·
6 authors

Submitted by

akhaliq

Sora Generates Videos with Stunning Geometrical Consistency

·
6 authors

Submitted by

akhaliq

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

·
5 authors

Submitted by

akhaliq

Disentangled 3D Scene Generation with Layout Learning

·
5 authors

Submitted by

akhaliq

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

·
6 authors

Submitted by

akhaliq

VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction

·
11 authors