digital-human
updated
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper
• 2412.01106
• Published • 24
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper
• 2412.04448
• Published • 10
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper
• 2412.14963
• Published • 6
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
Animation Models
Paper
• 2502.01061
• Published • 225
Pippo: High-Resolution Multi-View Humans from a Single Image
Paper
• 2502.07785
• Published • 10
X-Dancer: Expressive Music to Human Dance Video Generation
Paper
• 2502.17414
• Published • 14
Motion Anything: Any to Motion Generation
Paper
• 2503.06955
• Published • 35
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based
Spatiotemporal Diffusion for Audio-driven Talking Portrait
Paper
• 2503.12963
• Published • 7
ChatAnyone: Stylized Real-time Portrait Video Generation with
Hierarchical Motion Diffusion Model
Paper
• 2503.21144
• Published • 27
MoCha: Towards Movie-Grade Talking Character Synthesis
Paper
• 2503.23307
• Published • 141
AvatarArtist: Open-Domain 4D Avatarization
Paper
• 2503.19906
• Published • 8
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation
with Hybrid Guidance
Paper
• 2504.01724
• Published • 68
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
• 2504.02542
• Published • 52
FantasyTalking: Realistic Talking Portrait Generation via Coherent
Motion Synthesis
Paper
• 2504.04842
• Published • 35
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High
Resolution
Paper
• 2505.00497
• Published • 17
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation
Paper
• 2505.10238
• Published • 10
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video
Diffusion Transformers
Paper
• 2506.00830
• Published • 7
FantasyPortrait: Enhancing Multi-Character Portrait Animation with
Expression-Augmented Diffusion Transformers
Paper
• 2507.12956
• Published • 25
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for
Audio-Driven Portrait Animation
Paper
• 2508.11255
• Published • 11
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive
Simulation
Paper
• 2508.19209
• Published • 42
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time
Autoregressive Video Generation
Paper
• 2508.19320
• Published • 29
Kling-Avatar: Grounding Multimodal Instructions for Cascaded
Long-Duration Avatar Animation Synthesis
Paper
• 2509.09595
• Published • 48
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Paper
• 2512.04677
• Published • 175
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
• 2512.11253
• Published • 39
KlingAvatar 2.0 Technical Report
Paper
• 2512.13313
• Published • 44
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
Paper
• 2601.00664
• Published • 57
DreamID-Omni: Unified Framework for Controllable Human-Centric Audio-Video Generation
Paper
• 2602.12160
• Published • 38