A collection of AIMv2 vision encoders that supports a number of resolutions, native resolution, and a distilled checkpoint.
-
apple/aimv2-large-patch14-224
Image Feature Extraction β’ Updated β’ 3.2k β’ 41 -
apple/aimv2-huge-patch14-224
Image Feature Extraction β’ Updated β’ 303 β’ 7 -
apple/aimv2-1B-patch14-224
Image Feature Extraction β’ Updated β’ 116 β’ 4 -
apple/aimv2-3B-patch14-224
Image Feature Extraction β’ Updated β’ 27 β’ 2