Submitted by akhaliq 31 MagicDance: Realistic Human Dance Video Generation with Motions & Facial Expressions Transfer · 9 authors 1
Submitted by akhaliq 30 HierSpeech++: Bridging the Gap between Semantic and Acoustic Representation of Speech by Hierarchical Variational Inference for Zero-shot Speech Synthesis · 4 authors 1
Submitted by akhaliq 28 SuGaR: Surface-Aligned Gaussian Splatting for Efficient 3D Mesh Reconstruction and High-Quality Mesh Rendering · 2 authors 3
Submitted by akhaliq 26 NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation · 3 authors 3
Submitted by akhaliq 22 Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models · 5 authors 4
Submitted by akhaliq 21 PhysGaussian: Physics-Integrated 3D Gaussians for Generative Dynamics · 7 authors 1
Submitted by akhaliq 19 PF-LRM: Pose-Free Large Reconstruction Model for Joint Pose and Shape Prediction · 9 authors 4
Submitted by akhaliq 14 GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning · 9 authors 1