--- title: "Metis-0.4 Quantized in GGUF" tags: - GGUF language: en --- ![Image description](https://i.postimg.cc/MGwhtFfF/tsune-fixed.png) # Tsunemoto GGUF's of Metis-0.4 This is a GGUF quantization of Metis-0.4. ## Original Repo Link: [Original Repository](https://huggingface.co/Mihaiii/Metis-0.4) ## Original Model Card: --- This is a merge between Metis-0.3 and Metis-0.1 having Metis-0.1 as base. It was done using [mergekit](https://github.com/cg123/mergekit). It works well with long system prompts. It isn't generic in a sense that it shouldn't be used for story telling, for example, but only for reasoning and text comprehension. This model is trained on a private dataset. The high GSM8K score is **NOT** because of the MetaMath dataset. # Prompt Format: ``` <|system|> {system_message} <|user|> {prompt} <|assistant|> ``` Merge config: ```yaml slices: - sources: - model: Mihaiii/Metis-0.3 layer_range: [0, 32] - model: Mihaiii/Metis-0.1 layer_range: [0, 32] merge_method: slerp base_model: Mihaiii/Metis-0.1 parameters: t: - filter: self_attn value: [0, 0.5, 0.3, 0.7, 1] - filter: mlp value: [1, 0.5, 0.7, 0.3, 0] - value: 0.5 # fallback for rest of tensors dtype: bfloat16 ```