
microsoft/xclip-base-patch16
Video Classification
โข
Updated
โข
5.31k
โข
3
Find images matching a text query
Find similar images by uploading a photo
Find similar images from a collection
FitDiT is a high-fidelity virtual try-on model.
Segment objects in images and videos using text prompts
Transform images using neural style transfer