Analyze images to detect objects, points, keypoints, or text
Text-to-3D and Image-to-3D Generation
The agent using over 9000 vision models from the HF Hub.
Generate any application by Vibe Coding
Segment objects in images using prompts
Segment and caption objects in images and videos
Detect objects in images or videos
State-of-the-art Zero-shot Object Detection