-
Anim-Director: A Large Multimodal Model Powered Agent for Controllable Animation Video Generation
Paper • 2408.09787 • Published • 8 -
Factorized-Dreamer: Training A High-Quality Video Generator with Limited and Low-Quality Data
Paper • 2408.10119 • Published • 17 -
ControlNeXt: Powerful and Efficient Control for Image and Video Generation
Paper • 2408.06070 • Published • 53 -
Tora: Trajectory-oriented Diffusion Transformer for Video Generation
Paper • 2407.21705 • Published • 27
Collections
Discover the best community collections!
Collections including paper arxiv:2408.16767
-
ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model
Paper • 2408.16767 • Published • 31 -
DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
Paper • 2411.16657 • Published • 19 -
Autoregressive Video Generation without Vector Quantization
Paper • 2412.14169 • Published • 14 -
Progressive Multimodal Reasoning via Active Retrieval
Paper • 2412.14835 • Published • 73
-
Controllable Text Generation for Large Language Models: A Survey
Paper • 2408.12599 • Published • 65 -
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations
Paper • 2408.12590 • Published • 36 -
Real-Time Video Generation with Pyramid Attention Broadcast
Paper • 2408.12588 • Published • 16 -
Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model
Paper • 2408.11039 • Published • 59
-
CompGS: Efficient 3D Scene Representation via Compressed Gaussian Splatting
Paper • 2404.09458 • Published • 7 -
FLoD: Integrating Flexible Level of Detail into 3D Gaussian Splatting for Customizable Rendering
Paper • 2408.12894 • Published • 6 -
Towards Realistic Example-based Modeling via 3D Gaussian Stitching
Paper • 2408.15708 • Published • 8 -
3D Reconstruction with Spatial Memory
Paper • 2408.16061 • Published • 15
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper • 2401.09416 • Published • 11 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper • 2401.10171 • Published • 14 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 22 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper • 2401.12979 • Published • 9