Update README.md
Browse files
README.md
CHANGED
@@ -47,6 +47,16 @@ using [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).
|
|
47 |
|
48 |
```
|
49 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
---
|
51 |
|
52 |
## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!
|
|
|
47 |
|
48 |
```
|
49 |
|
50 |
+
### Note: re: split f16 model files
|
51 |
+
|
52 |
+
To merge the split model files for the f16 precision GGUFs, you can run the `llama-gguf-split` command that comes included when you build llama.cpp & [its examples](https://github.com/ggml-org/llama.cpp/tree/5e43f104cca1a14874e980326a506b44fde022b8/examples/gguf-split).
|
53 |
+
|
54 |
+
It accepts the path to the first of the downloaded splits, assuming the following to be alongside it, and an output path. For example:
|
55 |
+
|
56 |
+
```sh
|
57 |
+
~/llama.cpp $ ./build/bin/llama-gguf-split --merge ~/Downloads/qwq-32b.f16.split-00001-of-00002.gguf ~/Downloads/qwq-32b.f16.gguf
|
58 |
+
```
|
59 |
+
|
60 |
---
|
61 |
|
62 |
## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!
|