digital-human
updated
One Shot, One Talk: Whole-body Talking Avatar from a Single Image
Paper
• 2412.01106
• Published
• 24
MEMO: Memory-Guided Diffusion for Expressive Talking Video Generation
Paper
• 2412.04448
• Published
• 10
IDOL: Instant Photorealistic 3D Human Creation from a Single Image
Paper
• 2412.14963
• Published
• 6
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human
Animation Models
Paper
• 2502.01061
• Published
• 223
Pippo: High-Resolution Multi-View Humans from a Single Image
Paper
• 2502.07785
• Published
• 10
X-Dancer: Expressive Music to Human Dance Video Generation
Paper
• 2502.17414
• Published
• 14
Motion Anything: Any to Motion Generation
Paper
• 2503.06955
• Published
• 35
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based
Spatiotemporal Diffusion for Audio-driven Talking Portrait
Paper
• 2503.12963
• Published
• 7
ChatAnyone: Stylized Real-time Portrait Video Generation with
Hierarchical Motion Diffusion Model
Paper
• 2503.21144
• Published
• 27
MoCha: Towards Movie-Grade Talking Character Synthesis
Paper
• 2503.23307
• Published
• 139
AvatarArtist: Open-Domain 4D Avatarization
Paper
• 2503.19906
• Published
• 8
DreamActor-M1: Holistic, Expressive and Robust Human Image Animation
with Hybrid Guidance
Paper
• 2504.01724
• Published
• 68
Audio-visual Controlled Video Diffusion with Masked Selective State
Spaces Modeling for Natural Talking Head Generation
Paper
• 2504.02542
• Published
• 51
FantasyTalking: Realistic Talking Portrait Generation via Coherent
Motion Synthesis
Paper
• 2504.04842
• Published
• 35
KeySync: A Robust Approach for Leakage-free Lip Synchronization in High
Resolution
Paper
• 2505.00497
• Published
• 17
MTVCrafter: 4D Motion Tokenization for Open-World Human Image Animation
Paper
• 2505.10238
• Published
• 10
SkyReels-Audio: Omni Audio-Conditioned Talking Portraits in Video
Diffusion Transformers
Paper
• 2506.00830
• Published
• 7
FantasyPortrait: Enhancing Multi-Character Portrait Animation with
Expression-Augmented Diffusion Transformers
Paper
• 2507.12956
• Published
• 25
FantasyTalking2: Timestep-Layer Adaptive Preference Optimization for
Audio-Driven Portrait Animation
Paper
• 2508.11255
• Published
• 11
OmniHuman-1.5: Instilling an Active Mind in Avatars via Cognitive
Simulation
Paper
• 2508.19209
• Published
• 42
MIDAS: Multimodal Interactive Digital-human Synthesis via Real-time
Autoregressive Video Generation
Paper
• 2508.19320
• Published
• 29
Kling-Avatar: Grounding Multimodal Instructions for Cascaded
Long-Duration Avatar Animation Synthesis
Paper
• 2509.09595
• Published
• 48
Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length
Paper
• 2512.04677
• Published
• 171
PersonaLive! Expressive Portrait Image Animation for Live Streaming
Paper
• 2512.11253
• Published
• 36
KlingAvatar 2.0 Technical Report
Paper
• 2512.13313
• Published
• 43
Avatar Forcing: Real-Time Interactive Head Avatar Generation for Natural Conversation
Paper
• 2601.00664
• Published
• 56