Zesen Cheng's picture

Zesen Cheng

ClownRat

·

AI & ML interests

multi-modal foundation model; Segmentation, Detection, and Tracking;

Recent Activity

upvoted an article 1 day ago

Mixture of Experts Explained

upvoted an article 2 days ago

SigLIP 2: A better multilingual vision language encoder

upvoted a paper 4 days ago

Qwen2.5-VL Technical Report

View all activity

Organizations

Collections 1

Papers 13

arxiv:2502.13923

arxiv:2501.13106

arxiv:2501.00599

arxiv:2411.08147

models 5

ClownRat/VideoLLaMA2.1-7B-16F

Text Generation • Updated Jan 6 • 10

ClownRat/resnet-50-torchvision

Updated Dec 25, 2024 • 13

ClownRat/mask2former-resnet-50-coco-instance

Updated Dec 25, 2024 • 74

ClownRat/resnet-101-torchvision

Updated Dec 23, 2024 • 9

ClownRat/mask2former-resnet-101-coco-instance

Updated Dec 17, 2024 • 50

datasets 2

ClownRat/YoutubeVIS-2019

Updated 29 days ago • 38

ClownRat/COCO2017-Instance

Viewer • Updated Dec 11, 2024 • 123k • 85 • 1