Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

  • This model belongs to the family of official Lotus models.
  • Compared to the previous version, this model is trained in disparity space (inverse depth), achieving better performance and more stable video depth estimation.

Paper Paper HuggingFace Demo GitHub

Developed by: Jing He✱, Haodong Li✱, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen✉

teaser teaser

Usage

Please refer to this page.

Downloads last month
1,059
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the HF Inference API does not support diffusers models with pipeline type depth-estimation