Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Chenming Zhu's picture
1 17 3

Chenming Zhu

ChaimZhu
qq32026's profile picture sbrandeis's profile picture KevinHuang's profile picture
·
https://zcmax.github.io/
  • Eri_Chu_
  • ZCMax

AI & ML interests

Multimodal Large Language Models, 3D Perception and Understanding, Embodied AI

Recent Activity

upvoted a paper 15 days ago
MMSI-Video-Bench: A Holistic Benchmark for Video-Based Spatial Intelligence
upvoted a paper 22 days ago
Ground Slow, Move Fast: A Dual-System Foundation Model for Generalizable Vision-and-Language Navigation
upvoted a paper about 1 month ago
G^2VLM: Geometry Grounded Vision Language Model with Unified 3D Reconstruction and Spatial Reasoning
View all activity

Organizations

HKU LIU Vision Group's profile picture Intern Robotics's profile picture

authored a paper 6 months ago

OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding

Paper • 2507.07984 • Published Jul 10, 2025 • 42
authored a paper over 1 year ago

LLaVA-3D: A Simple yet Effective Pathway to Empowering LMMs with 3D-awareness

Paper • 2409.18125 • Published Sep 26, 2024 • 34
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs