Apply for community grant: Academic project (gpu and storage)

#1
by mucai - opened

We present Matryoshka Multimodal Models (M3), which represents visual tokens in a nested manner following the coarse-to-fine order. Now users can explicitly control the visual granularity per test instance during inference! It will be great to host this model in huggingface!
@akhaliq

teaser.png

Hi @mucai , we've assigned ZeroGPU to this Space. Please check the compatibility and usage sections of this page so your Space can run on ZeroGPU.

Huge thanks!

Sign up or log in to comment