please create a quantized version, preferably using bitsandbytes!

by ctranslate2-4you - opened 17 days ago

Discussion

ctranslate2-4you

17 days ago

Really like the model but would like to use it with BitsandBytes...

amanrangapur

Ai2 org 17 days ago

Hey @ctranslate2-4you , check this out: https://huggingface.co/allenai/olmOCR-7B-0225-preview-GGUF

cnmoro

16 days ago

How do we perform inference on a pair of image + prompt, using gguf?

ctranslate2-4you

15 days ago

Hey @ctranslate2-4you , check this out: https://huggingface.co/allenai/olmOCR-7B-0225-preview-GGUF

Nice, but I prefer to use BNB for now. do you guys plan on making one? Otherwise, I'd have to add a bunch of llama.cpp dependencies just for this..

amanrangapur

Ai2 org 15 days ago

•

edited 15 days ago

Hey @ctranslate2-4you , check this out: https://huggingface.co/allenai/olmOCR-7B-0225-preview-GGUF

Nice, but I prefer to use BNB for now. do you guys plan on making one? Otherwise, I'd have to add a bunch of llama.cpp dependencies just for this..

Noo, not atm. If you plan to make one, can you push it to HF.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment