LiuZhiHao's picture

25 8

LiuZhiHao

ZhiHao9806

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 14 days ago

VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction

upvoted a paper 14 days ago

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

upvoted a paper 14 days ago

Disentangled 3D Scene Generation with Layout Learning

View all activity

Organizations

None yet

ZhiHao9806's activity

upvoted 20 papers 14 days ago

VastGaussian: Vast 3D Gaussians for Large Scene Reconstruction

Paper • 2402.17427 • Published Feb 27, 2024 • 11

Playground v2.5: Three Insights towards Enhancing Aesthetic Quality in Text-to-Image Generation

Paper • 2402.17245 • Published Feb 27, 2024 • 12

Disentangled 3D Scene Generation with Layout Learning

Paper • 2402.16936 • Published Feb 26, 2024 • 12

Seeing and Hearing: Open-domain Visual-Audio Generation with Diffusion Latent Aligners

Paper • 2402.17723 • Published Feb 27, 2024 • 16

Sora Generates Videos with Stunning Geometrical Consistency

Paper • 2402.17403 • Published Feb 27, 2024 • 18

Towards Optimal Learning of Language Models

Paper • 2402.17759 • Published Feb 27, 2024 • 18

Training-Free Long-Context Scaling of Large Language Models

Paper • 2402.17463 • Published Feb 27, 2024 • 23

Evaluating Very Long-Term Conversational Memory of LLM Agents

Paper • 2402.17753 • Published Feb 27, 2024 • 20

Video as the New Language for Real-World Decision Making

Paper • 2402.17139 • Published Feb 27, 2024 • 20

DiffuseKronA: A Parameter Efficient Fine-tuning Method for Personalized Diffusion Model

Paper • 2402.17412 • Published Feb 27, 2024 • 23

OmniACT: A Dataset and Benchmark for Enabling Multimodal Generalist Autonomous Agents for Desktop and Web

Paper • 2402.17553 • Published Feb 27, 2024 • 24

When Scaling Meets LLM Finetuning: The Effect of Data, Model and Finetuning Method

Paper • 2402.17193 • Published Feb 27, 2024 • 25

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27, 2024 • 610

EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions

Paper • 2402.17485 • Published Feb 27, 2024 • 191

Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models

Paper • 2402.17177 • Published Feb 27, 2024 • 88

Rainbow Teaming: Open-Ended Generation of Diverse Adversarial Prompts

Paper • 2402.16822 • Published Feb 26, 2024 • 18

Towards Open-ended Visual Quality Comparison

Paper • 2402.16641 • Published Feb 26, 2024 • 19

ChatMusician: Understanding and Generating Music Intrinsically with LLM

Paper • 2402.16153 • Published Feb 25, 2024 • 60

Nemotron-4 15B Technical Report

Paper • 2402.16819 • Published Feb 26, 2024 • 45

FuseChat: Knowledge Fusion of Chat Models

Paper • 2402.16107 • Published Feb 25, 2024 • 40