Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
apple 's Collections
FastVLM
MobileCLIP2
DiffuCoder
AIMv2
Core ML Gallery Models
OpenELM Instruct Models
OpenELM Pretrained Models
MobileCLIP Models + DataCompDR Data
TiC-CLIP
DepthPro Models
Core ML Stable Diffusion
Core ML FastViT
Core ML Depth Anything
DFN Models + Data
AIM
DCLM
Core ML Segment Anything 2

FastVLM

updated 7 days ago

Efficient Vision Encoding for Vision Language Models

Upvote
59

  • apple/FastVLM-0.5B

    Text Generation • 0.8B • Updated 3 days ago • 3.11k • 118

  • apple/FastVLM-1.5B

    Text Generation • 2B • Updated 3 days ago • 786 • 21

  • apple/FastVLM-7B

    Text Generation • 8B • Updated 3 days ago • 2.26k • 101

  • apple/FastVLM-0.5B-fp16

    0.6B • Updated 7 days ago • 20 • 5

    Note MLX checkpoint


  • apple/FastVLM-1.5B-int8

    0.5B • Updated 7 days ago • 22 • 5

    Note MLX checkpoint


  • apple/FastVLM-7B-int4

    1B • Updated 7 days ago • 44 • 9

    Note MLX checkpoint

Upvote
59
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs