Tony Zhao
tianchez
AI & ML interests
Multimodal Agent, Generative AI
Recent Activity
upvoted
a
collection
8 days ago
VLM-R1-models
new activity
12 days ago
omlab/VLM-R1-Referral-Expression:Apply for community grant: Personal project (gpu)
replied to
their
post
18 days ago
Introducing VLM-R1!
GRPO has helped DeepSeek R1 to learn reasoning. Can it also help VLMs perform stronger for general computer vision tasks?
The answer is YES and it generalizes better than SFT. We trained Qwen 2.5 VL 3B on RefCOCO (a visual grounding task) and eval on RefCOCO Val and RefGTA (an OOD task).
https://github.com/om-ai-lab/VLM-R1
Organizations
tianchez's activity
Apply for community grant: Personal project (gpu)
#3 opened 13 days ago
by
tianchez

Fixes 500 error for some users
1
#1 opened 22 days ago
by
Tonic

Update to correct ref: omlab/omdet-turbo-swin-tiny-hf
1
#2 opened 6 months ago
by
ozdeadman

Image guided object detection
1
#3 opened 5 months ago
by
godaspeg
is there any opensource repo for this?
3
#1 opened 7 months ago
by
lucasjin