Zesen Cheng
ClownRat
AI & ML interests
multi-modal foundation model; Segmentation, Detection, and Tracking;
Recent Activity
upvoted
an
article
1 day ago
Mixture of Experts Explained
upvoted
an
article
2 days ago
SigLIP 2: A better multilingual vision language encoder
upvoted
a
paper
4 days ago
Qwen2.5-VL Technical Report