Gemini 2.0 native image generation co-doodling
Generate edited images with prompts
VGGT (CVPR 2025)
Detect objects in images or videos
Wan: Open and Advanced Large-Scale Video Generative Models
A Generalist Diffusion Model for Vision Perception
Select code snippets and launch demos for AI models
Text-to-3D and Image-to-3D Generation
Scalable and Versatile 3D Generation from images
https://huggingface.co/papers/2501.03006
Extend images using prompts and alignment options
Convert images to 3D depth maps
Generate modified images from prompts with styles
create games with AI
Quickly edit the expression of a face
Generate and edit audio from text prompts
High-fidelity Virtual Try-on
Overlay garment on person image