Model Description

YOLOE: Real-Time Seeing Anything

Installation

pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/CLIP
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/ml-mobileclip
pip install git+https://github.com/THU-MIG/yoloe.git#subdirectory=third_party/lvis-api
pip install git+https://github.com/THU-MIG/yoloe.git

wget https://docs-assets.developer.apple.com/ml-research/datasets/mobileclip/mobileclip_blt.pt

Prediction

Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#prediction.

Transferring

Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#transferring.

Validation

Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#validation.

Training

Please refer to https://github.com/THU-MIG/yoloe?tab=readme-ov-file#training.

BibTeX Entry and Citation Info

@misc{wang2025yoloerealtimeseeing,
     title={YOLOE: Real-Time Seeing Anything}, 
     author={Ao Wang and Lihao Liu and Hui Chen and Zijia Lin and Jungong Han and Guiguang Ding},
     year={2025},
     eprint={2503.07465},
     archivePrefix={arXiv},
     primaryClass={cs.CV},
     url={https://arxiv.org/abs/2503.07465}, 
}
Downloads last month
54
Safetensors
Model size
15.4M params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.

Collection including jameslahm/yoloe-v8s-seg