mcm07
/

phi-4-GGUF-imatrix

Inference Endpoints

Model card Files Files and versions Community

phi-4-GGUF-imatrix / README.md

mcm07's picture

Upload folder using huggingface_hub

9545aa5 verified about 2 months ago

|

history blame contribute delete

900 Bytes

	---
	license: mit
	license_link: https://huggingface.co/microsoft/phi-4/resolve/main/LICENSE
	language:
	- en
	tags:
	- phi
	- phi4
	- unsloth
	- nlp
	- math
	- code
	- chat
	- conversational
	base_model: microsoft/phi-4
	---

	## llama.cpp and ik_llama.cpp imatrix Quantizations of [unsloth/phi-4-GGUF](https://huggingface.co/unsloth/phi-4-GGUF)

	Imatrix and quantizations created from https://huggingface.co/unsloth/phi-4-GGUF/phi-4-F16.gguf

	Imatrix dataset from [bartowski1182](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)

	## [llama.cpp](https://github.com/ggerganov/llama.cpp) quants

	```
	phi-4-IQ2_S.gguf
	phi-4-IQ3_XS.gguf
	phi-4-Q4_K_M.gguf
	phi-4-IQ4_XS.gguf
	phi-4-IQ4_NL.gguf
	```

	## [ik_llama.cpp](https://github.com/ikawrakow/ik_llama.cpp) new quants

	```
	phi-4-IQ4_KS.gguf
	phi-4-IQ4_NL_R4.gguf
	```

	## Credits
	llama.cpp, ik_llama.cpp, bartowski, microsoft, unsloth, huggingface