Unified Multimodal Model Collection A curated list for Multimodal Model Generation papers. • 18 items • Updated 28 days ago • 4
Wan-Move: Motion-controllable Video Generation via Latent Trajectory Guidance Paper • 2512.08765 • Published 16 days ago • 125
Z-Image: An Efficient Image Generation Foundation Model with Single-Stream Diffusion Transformer Paper • 2511.22699 • Published 28 days ago • 212
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper • 2512.03046 • Published 23 days ago • 11
Style Customization of Text-to-Vector Generation with Image Diffusion Priors Paper • 2505.10558 • Published May 15 • 16
AniGAN: Style-Guided Generative Adversarial Networks for Unsupervised Anime Face Generation Paper • 2102.12593 • Published Feb 24, 2021 • 1
A Large-scale Dataset for Robust Complex Anime Scene Text Detection Paper • 2510.07951 • Published Oct 9 • 6
Emu3.5 Collection Native Multimodal Models are World Learners 🌍 • 4 items • Updated about 10 hours ago • 72
Inpaint4Drag: Repurposing Inpainting Models for Drag-Based Image Editing via Bidirectional Warping Paper • 2509.04582 • Published Sep 4 • 7
Speed Up Model Collection A curated list of speed up model in multimodal generation. • 18 items • Updated Nov 12 • 1
USO: Unified Style and Subject-Driven Generation via Disentangled and Reward Learning Paper • 2508.18966 • Published Aug 26 • 56
ToonComposer: Streamlining Cartoon Production with Generative Post-Keyframing Paper • 2508.10881 • Published Aug 14 • 52
Skywork-UniPic2 Collection A Unified DiT Multimodal Model for Image Generation, Editing, and Understanding • 8 items • Updated Aug 22 • 10
NoHumansRequired: Autonomous High-Quality Image Editing Triplet Mining Paper • 2507.14119 • Published Jul 18 • 58
FLUX.1 Kontext-dev Collection A curated list of relative models of FLUX.1 Kontext-dev • 45 items • Updated 28 days ago • 6
Portrait Stylization Collection A curated list for portrait stylization. • 20 items • Updated Jun 28 • 1