--- base_model: - Nohobby/MS3-test-Merge-1 - ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4 - trashpanda-org/Llama3-24B-Mullein-v1 - TheDrummer/Cydonia-24B-v2 library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Linear DELLA](https://arxiv.org/abs/2406.11617) merge method using [Nohobby/MS3-test-Merge-1](https://huggingface.co/Nohobby/MS3-test-Merge-1) as a base. ### Models Merged The following models were included in the merge: * [ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4](https://huggingface.co/ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4) * [trashpanda-org/Llama3-24B-Mullein-v1](https://huggingface.co/trashpanda-org/Llama3-24B-Mullein-v1) * [TheDrummer/Cydonia-24B-v2](https://huggingface.co/TheDrummer/Cydonia-24B-v2) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: ArliAI/Mistral-Small-24B-ArliAI-RPMax-v1.4 parameters: weight: 0.2 density: 0.7 - model: trashpanda-org/Llama3-24B-Mullein-v1 parameters: weight: 0.2 density: 0.7 - model: TheDrummer/Cydonia-24B-v2 parameters: weight: 0.2 density: 0.7 merge_method: della_linear base_model: Nohobby/MS3-test-Merge-1 parameters: epsilon: 0.2 lambda: 1.1 dtype: bfloat16 tokenizer: source: base ```