File size: 1,383 Bytes
cb2fa5f
 
 
 
 
 
 
 
 
 
 
 
 
 
 
e81aef7
 
cb2fa5f
 
 
 
e81aef7
cb2fa5f
 
e81aef7
cb2fa5f
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
---
base_model: knifeayumu/Cydonia-v1.3-Magnum-v4-22B
language:
- en
license: mit
quantized_by: SpongeQuant
tags:
- SpongeQuant
- i1-GGUF
---


Quantized to `i1-GGUF` using [SpongeQuant](https://github.com/SpongeEngine/SpongeQuant), the Oobabooga of LLM quantization. Chat & support at [Sponge Engine](https://discord.gg/azNmr2Gdgy).

<figure>
  <img src="https://huggingface.co/spaces/SpongeEngine/README/resolve/main/095.png" alt="95. Sydney Opera House">
  <figcaption>95. Sydney Opera House</figcaption>
</figure>

<figure>
  <audio controls>
    <source src="https://huggingface.co/spaces/SpongeEngine/README/resolve/main/011.mp3" type="audio/mp3">
    Your browser does not support the audio element.
  </audio>
  <figcaption>11. Johnny B. Goode – Chuck Berry</figcaption>
</figure>

***
### What is a GGUF?
GGUF is a type of file format used for running LLMs (large language models) on different types of computers. It works on both regular processors (CPU) and graphics cards (GPU). Some LLMs need powerful and expensive hardware, but GGUF makes it possible to run them on a wider range of computers, even ones without high-end GPUs. To make this possible, GGUF models use a technique called quantization, which reduces their size and memory usage. This helps them run more efficiently, but at lower settings, the model might lose some accuracy or detail in its responses.