Collections
Discover the best community collections!
Collections including paper arxiv:2310.08579
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper • 2306.07967 • Published • 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper • 2306.07954 • Published • 112 -
TryOnDiffusion: A Tale of Two UNets
Paper • 2306.08276 • Published • 73 -
Seeing the World through Your Eyes
Paper • 2306.09348 • Published • 33
-
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 42 -
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper • 2311.00618 • Published • 23 -
MM-VID: Advancing Video Understanding with GPT-4V(ision)
Paper • 2310.19773 • Published • 20 -
SAM-CLIP: Merging Vision Foundation Models towards Semantic and Spatial Understanding
Paper • 2310.15308 • Published • 23
-
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Paper • 2310.08579 • Published • 17 -
MotionDirector: Motion Customization of Text-to-Video Diffusion Models
Paper • 2310.08465 • Published • 16 -
DPM-Solver-v3: Improved Diffusion ODE Solver with Empirical Model Statistics
Paper • 2310.13268 • Published • 18 -
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Paper • 2310.19512 • Published • 16
-
Prometheus: Inducing Fine-grained Evaluation Capability in Language Models
Paper • 2310.08491 • Published • 54 -
HyperHuman: Hyper-Realistic Human Generation with Latent Structural Diffusion
Paper • 2310.08579 • Published • 17 -
Vision-Language Models are Zero-Shot Reward Models for Reinforcement Learning
Paper • 2310.12921 • Published • 19 -
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper • 2311.00618 • Published • 23