GGUF When?

by xTimeCrystal - opened 7 days ago

Discussion

xTimeCrystal

7 days ago

When is the GGUF quantized version releasing?

l33tkr3w

7 days ago

already is one, https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf

tc-mb

OpenBMB org 7 days ago

https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf
We've received your question. We've provided support for gguf in this repository. There should be a lot of it. The cache may not have been refreshed. Please check again.

zhouxihong

about 20 hours ago

https://huggingface.co/openbmb/MiniCPM-V-4_5-gguf
We've received your question. We've provided support for gguf in this repository. There should be a lot of it. The cache may not have been refreshed. Please check again.

May I ask when the issue of gguf in llama-server being unable to disable reasoning mode after startup will be resolved? Thank you very much.

tc-mb

OpenBMB org about 15 hours ago

@zhouxihong Ok, I will submit a PR to resolve it before Wednesday.

tc-mb

OpenBMB org 16 minutes ago

@zhouxihong I noticed that llama.cpp already includes instructions for using think mode.
You can disable think mode for the model by specifying "export LLAMA_ARG_THINK=0" to disable the think mode environment variable.
Hope this helps. If you have any further questions, feel free to raise an issue.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment