This is a great model

by FrenzyBiscuit - opened Jan 24

Jan 24

•

It works great once you start using Qwenception.

Can you do a 1.5B version for speculative decoding?

Jan 24

Actually, would be cool to see a 14B version as well.

Owner Jan 25

Thanks! I'll probably switch gears to 1.5B (since experiments will be cheaper) then come back to 14B. :)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment