TurboDiffusion: Accelerating Video Diffusion Models by 100-200 Times Paper ⢠2512.16093 ⢠Published 17 days ago ⢠90
HiStream: Efficient High-Resolution Video Generation via Redundancy-Eliminated Streaming Paper ⢠2512.21338 ⢠Published 11 days ago ⢠20
OneStory: Coherent Multi-Shot Video Generation with Adaptive Memory Paper ⢠2512.07802 ⢠Published 27 days ago ⢠43
Mixture of States: Routing Token-Level Dynamics for Multimodal Generation Paper ⢠2511.12207 ⢠Published Nov 15, 2025 ⢠9
Scaling Zero-Shot Reference-to-Video Generation Paper ⢠2512.06905 ⢠Published 28 days ago ⢠28
TUNA: Taming Unified Visual Representations for Native Unified Multimodal Models Paper ⢠2512.02014 ⢠Published Dec 1, 2025 ⢠70
One Small Step in Latent, One Giant Leap for Pixels: Fast Latent Upscale Adapter for Your Diffusion Models Paper ⢠2511.10629 ⢠Published Nov 13, 2025 ⢠123
Tinker: Diffusion's Gift to 3D--Multi-View Consistent Editing From Sparse Inputs without Per-Scene Optimization Paper ⢠2508.14811 ⢠Published Aug 20, 2025 ⢠42
Gemini 2.5: Pushing the Frontier with Advanced Reasoning, Multimodality, Long Context, and Next Generation Agentic Capabilities Paper ⢠2507.06261 ⢠Published Jul 7, 2025 ⢠64
Sekai: A Video Dataset towards World Exploration Paper ⢠2506.15675 ⢠Published Jun 18, 2025 ⢠65
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper ⢠2506.13585 ⢠Published Jun 16, 2025 ⢠273
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning Paper ⢠2506.01713 ⢠Published Jun 2, 2025 ⢠48
Beyond Distillation: Pushing the Limits of Medical LLM Reasoning with Minimalist Rule-Based RL Paper ⢠2505.17952 ⢠Published May 23, 2025 ⢠20
Cosmos Collection ā ļø This collection is archived. š https://huggingface.co/collections/nvidia/cosmos-predict25 ⢠31 items ⢠Updated 12 days ago ⢠299