brittlewis12
/

QwQ-32B-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

brittlewis12 commited on 5 days ago

Commit

765af50

·

verified ·

1 Parent(s): d140ee8

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -47,6 +47,16 @@ using [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).
 ```
 ---
 ## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!

 ```
+### Note: re: split f16 model files
+To merge the split model files for the f16 precision GGUFs, you can run the `llama-gguf-split` command that comes included when you build llama.cpp & [its examples](https://github.com/ggml-org/llama.cpp/tree/5e43f104cca1a14874e980326a506b44fde022b8/examples/gguf-split).
+It accepts the path to the first of the downloaded splits, assuming the following to be alongside it, and an output path. For example:
+```sh
+ ~/llama.cpp $ ./build/bin/llama-gguf-split --merge ~/Downloads/qwq-32b.f16.split-00001-of-00002.gguf ~/Downloads/qwq-32b.f16.gguf
+```
 ---
 ## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!