phi-4-GGUF-imatrix / README.md
mcm07's picture
Upload folder using huggingface_hub
9545aa5 verified
---
license: mit
license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
language:
- en
tags:
- phi
- phi4
- unsloth
- nlp
- math
- code
- chat
- conversational
base_model: microsoft/phi-4
---
## llama.cpp and ik_llama.cpp imatrix Quantizations of [unsloth/phi-4-GGUF](https://huggingface.co/unsloth/phi-4-GGUF)
Imatrix and quantizations created from https://huggingface.co/unsloth/phi-4-GGUF/phi-4-F16.gguf
Imatrix dataset from [bartowski1182](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
## [llama.cpp](https://github.com/ggerganov/llama.cpp) quants
```
phi-4-IQ2_S.gguf
phi-4-IQ3_XS.gguf
phi-4-Q4_K_M.gguf
phi-4-IQ4_XS.gguf
phi-4-IQ4_NL.gguf
```
## [ik_llama.cpp](https://github.com/ikawrakow/ik_llama.cpp) new quants
```
phi-4-IQ4_KS.gguf
phi-4-IQ4_NL_R4.gguf
```
## Credits
llama.cpp, ik_llama.cpp, bartowski, microsoft, unsloth, huggingface