Boosting Latent Diffusion Models via Disentangled Representation Alignment Paper • 2601.05823 • Published 21 days ago • 17
Boosting Latent Diffusion Models via Disentangled Representation Alignment Paper • 2601.05823 • Published 21 days ago • 17
Is Nano Banana Pro a Low-Level Vision All-Rounder? A Comprehensive Evaluation on 14 Tasks and 40 Datasets Paper • 2512.15110 • Published Dec 17, 2025 • 10
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published Nov 28, 2025 • 16
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published Nov 28, 2025 • 16
VQRAE: Representation Quantization Autoencoders for Multimodal Understanding, Generation and Reconstruction Paper • 2511.23386 • Published Nov 28, 2025 • 16
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published Nov 13, 2025 • 62
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published Nov 13, 2025 • 62
A Style is Worth One Code: Unlocking Code-to-Style Image Generation with Discrete Style Space Paper • 2511.10555 • Published Nov 13, 2025 • 62 • 2
Sample By Step, Optimize By Chunk: Chunk-Level GRPO For Text-to-Image Generation Paper • 2510.21583 • Published Oct 24, 2025 • 31 • 3