Submitted by akhaliq 54 Amphion: An Open-Source Audio, Music and Speech Generation Toolkit · 13 authors 4
Submitted by akhaliq 38 ReST meets ReAct: Self-Improvement for Multi-Step Reasoning LLM Agent · 13 authors 1
Submitted by akhaliq 25 DreamTalk: When Expressive Talking Head Generation Meets Diffusion Probabilistic Models · 6 authors 2
Submitted by akhaliq 14 Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models · 8 authors 1
Submitted by akhaliq 13 Extending Context Window of Large Language Models via Semantic Compression · 7 authors 1
Submitted by akhaliq 7 Faithful Persona-based Conversational Dataset Generation with Large Language Models · 5 authors 1