Zesen Cheng
ClownRat
AI & ML interests
multi-modal foundation model; Segmentation, Detection, and Tracking;
Recent Activity
upvoted
an
article
1 day ago
Mixture of Experts Explained
upvoted
an
article
1 day ago
SigLIP 2: A better multilingual vision language encoder
upvoted
a
paper
3 days ago
Qwen2.5-VL Technical Report