TempoMaster: Efficient Long Video Generation via Next-Frame-Rate Prediction
Paper
•
2511.12578
•
Published
•
1
TempoMaster is a video diffusion model built on Wan-Video, which is capable of generating videos at various frame rates.
The model first generates a low-frame-rate video as a global blueprint. It then uses the existing frames as temporal anchors to infer and insert additional frames in between, progressively upsampling the video to higher frame rates.
This approach effectively structures long-term temporal dynamics and mitigates the issue of visual drifting caused by error accumulation.
Base model
Wan-AI/Wan2.2-I2V-A14B