Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,7 @@ USER: prompt
|
|
15 |
ASSISTANT:
|
16 |
```
|
17 |
|
18 |
-
*The quality of the 3-bit model is higher than the 2-bit model, but the interface is slower. The 3-bit model (type q3_K_S) barely fits into 16
|
19 |
```
|
20 |
llama_model_load_internal: mem required = 15716.00 MB (+ 3124.00 MB per state)
|
21 |
```
|
|
|
15 |
ASSISTANT:
|
16 |
```
|
17 |
|
18 |
+
*The quality of the 3-bit model is higher than the 2-bit model, but the interface is slower. The 3-bit model (type q3_K_S) barely fits into 16 GB of RAM, but it works.*
|
19 |
```
|
20 |
llama_model_load_internal: mem required = 15716.00 MB (+ 3124.00 MB per state)
|
21 |
```
|