model refuses to load server side within transformers.js and struggles hard in the browser

#24

by shoebill-droyd - opened Jan 28

Jan 28

its a pretty decent model considering its only 1.7B parameters, however i cannot get it working in nodejs for the life of me, and it has HUGE load times within the browser. why even go through the bother of marking it as transformers.js compatible in this broken state????? just seems misleading to me

Xenova

Hugging Face Smol Models Research org 18 days ago

Hi there :) You can use a lower quantization (e.g., q4f16 or q4) depending on your needs, which runs at ~80 tokens per second in-browser on an M4 Pro Max.

For Node.js support, we've added external data support in Transformers.js v3.4, which means you can load the larger models if you'd like.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment