OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper β’ 2512.07802 β’ Published 17 days ago β’ 43
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper β’ 2512.07802 β’ Published 17 days ago β’ 43
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published Nov 15 β’ 8
Scaling Zero-Shot Reference-to-Video Generation Paper β’ 2512.06905 β’ Published 18 days ago β’ 28
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 24 days ago β’ 66
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper β’ 2511.12207 β’ Published Nov 15 β’ 8
Scaling Zero-Shot Reference-to-Video Generation Paper β’ 2512.06905 β’ Published 18 days ago β’ 28 β’ 4
Scaling Zero-Shot Reference-to-Video Generation Paper β’ 2512.06905 β’ Published 18 days ago β’ 28
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper β’ 2512.02014 β’ Published 24 days ago β’ 66
Running on Zero MCP Featured 1.67k Qwen Image Edit Camera Control π¬ 1.67k Fast 4 step inference with Qwen Image Edit 2509
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper β’ 2511.10629 β’ Published Nov 13 β’ 122
Running on Zero MCP Featured 2.57k Wan2.2 14B Fast π₯ 2.57k generate a video from an image with a text prompt