Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Om AI Lab

company
https://github.com/om-ai-lab
OmAI_lab
om-ai-lab
Activity Feed

AI & ML interests

Multimodal AI, Agents

Recent Activity

P3ngLiuΒ  updated a model 2 days ago
omlab/VLM-FO1_Qwen2.5-VL-3B-v01
P3ngLiuΒ  updated a collection 5 days ago
VLM-FO1-Models
P3ngLiuΒ  published a model 5 days ago
omlab/VLM-FO1_Qwen2.5-VL-3B-v01
View all activity

Papers

VLM-FO1: Bridging the Gap Between High-Level Reasoning and Fine-Grained Perception in VLMs

View all Papers

Articles

Trials, Errors, and Breakthroughs: Our Rocky Road to OVD SOTA with Reinforcement Learning

Mar 25
β€’ 2

ImprovingΒ ObjectΒ DetectionΒ throughΒ ReinforcementΒ LearningΒ withΒ VLM-R1

Mar 25
β€’ 3

Tony Zhao's profile picture Zilun's profile picture Peng Liu's profile picture

omlab 's Spaces 5

pinned
Running
17

Open Agent Leaderboard

πŸ₯‡

Open Agent Leaderboard

Jun 26
Runtime error
72

VLM R1 Referral Expression

πŸ’¬

Mark regions in images based on text descriptions

Apr 18
Sleeping
1

OmAgent

πŸ’¬

Process and answer questions about webpage videos

Mar 26
Runtime error
19

VLM R1 OVD

πŸ‘

VLM-R1 model for Open-Vocabulary Object Detection

Mar 21
Running

README

πŸ†

Jan 24
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs