Felladrin commited on
Commit
9127270
·
verified ·
1 Parent(s): e4e41d5

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -1
README.md CHANGED
@@ -5,4 +5,26 @@ base_model: Isotonic/Mixnueza-6x32M-MoE
5
 
6
  GGUF version of [Isotonic/Mixnueza-6x32M-MoE](https://huggingface.co/Isotonic/Mixnueza-6x32M-MoE).
7
 
8
- It was not possible to quantize the model after converting it to F16/F32 GGUF, so only those versions are available.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
5
 
6
  GGUF version of [Isotonic/Mixnueza-6x32M-MoE](https://huggingface.co/Isotonic/Mixnueza-6x32M-MoE).
7
 
8
+ It was not possible to quantize the model, so only the F16 and F32 GGUF files are available.
9
+
10
+ ## Try it with [llama.cpp](https://github.com/ggerganov/llama.cpp)
11
+
12
+ ```bash
13
+ brew install ggerganov/ggerganov/llama.cpp
14
+ ```
15
+ ```bash
16
+ llama-cli \
17
+ --hf-repo Felladrin/gguf-Mixnueza-6x32M-MoE \
18
+ --model Mixnueza-6x32M-MoE.F32.gguf \
19
+ --random-prompt \
20
+ --dynatemp-range "0.1-2.5" \
21
+ --top-k 0 \
22
+ --top-p 1 \
23
+ --min-p 0.1 \
24
+ --typical 0.85 \
25
+ --mirostat 2 \
26
+ --mirostat-ent 3.5 \
27
+ --repeat-penalty 1.1 \
28
+ --repeat-last-n -1 \
29
+ -n 256
30
+ ```