iMatrix GGUFs for https://huggingface.co/01-ai/Yi-9B-200K

iMatrix generated with Kalomaze's semi-random groups_merged.txt

Included fp16 GGUF as well, just to save a conversion step if anybody else feels like doing additional quants.

Downloads last month
91
GGUF
Model size
8.83B params
Architecture
llama
Inference Examples
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.