-
OmnimatteRF: Robust Omnimatte with 3D Background Modeling
Paper • 2309.07749 • Published • 8 -
AudioSR: Versatile Audio Super-resolution at Scale
Paper • 2309.07314 • Published • 27 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 53 -
MagiCapture: High-Resolution Multi-Concept Portrait Customization
Paper • 2309.06895 • Published • 27
Collections
Discover the best community collections!
Collections including paper arxiv:2310.11448
-
InstaFlow: One Step is Enough for High-Quality Diffusion-Based Text-to-Image Generation
Paper • 2309.06380 • Published • 32 -
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper • 2309.05793 • Published • 50 -
Generative Image Dynamics
Paper • 2309.07906 • Published • 53 -
Show-1: Marrying Pixel and Latent Diffusion Models for Text-to-Video Generation
Paper • 2309.15818 • Published • 18
-
Large-Scale Automatic Audiobook Creation
Paper • 2309.03926 • Published • 54 -
113
Pop2Piano Demo
🎹Convert pop audio to piano cover
-
khanhld/wav2vec2-base-vietnamese-160h
Automatic Speech Recognition • Updated • 538 • 7 -
4K4D: Real-Time 4D View Synthesis at 4K Resolution
Paper • 2310.11448 • Published • 39