brittlewis12 commited on
Commit
765af50
·
verified ·
1 Parent(s): d140ee8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -0
README.md CHANGED
@@ -47,6 +47,16 @@ using [autogguf-rs](https://github.com/brittlewis12/autogguf-rs).
47
 
48
  ```
49
 
 
 
 
 
 
 
 
 
 
 
50
  ---
51
 
52
  ## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!
 
47
 
48
  ```
49
 
50
+ ### Note: re: split f16 model files
51
+
52
+ To merge the split model files for the f16 precision GGUFs, you can run the `llama-gguf-split` command that comes included when you build llama.cpp & [its examples](https://github.com/ggml-org/llama.cpp/tree/5e43f104cca1a14874e980326a506b44fde022b8/examples/gguf-split).
53
+
54
+ It accepts the path to the first of the downloaded splits, assuming the following to be alongside it, and an output path. For example:
55
+
56
+ ```sh
57
+ ~/llama.cpp $ ./build/bin/llama-gguf-split --merge ~/Downloads/qwq-32b.f16.split-00001-of-00002.gguf ~/Downloads/qwq-32b.f16.gguf
58
+ ```
59
+
60
  ---
61
 
62
  ## Download & run with [cnvrs](https://twitter.com/cnvrsai) on iPhone, iPad, and Mac!