--- base_model: - estrogen/MS2501-24b-Ink-apollo-ep2 - Nohobby/MS3-test-Merge-1 - huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated - ToastyPigeon/ms3-roselily-rp-v2 - PocketDoc/Dans-DangerousWinds-V1.1.1-24b library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [DELLA](https://arxiv.org/abs/2406.11617) merge method using [Nohobby/MS3-test-Merge-1](https://huggingface.co/Nohobby/MS3-test-Merge-1) as a base. ### Models Merged The following models were included in the merge: * [estrogen/MS2501-24b-Ink-apollo-ep2](https://huggingface.co/estrogen/MS2501-24b-Ink-apollo-ep2) * [huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated](https://huggingface.co/huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated) * [ToastyPigeon/ms3-roselily-rp-v2](https://huggingface.co/ToastyPigeon/ms3-roselily-rp-v2) * [PocketDoc/Dans-DangerousWinds-V1.1.1-24b](https://huggingface.co/PocketDoc/Dans-DangerousWinds-V1.1.1-24b) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: Nohobby/MS3-test-Merge-1 parameters: epsilon: 0.05 lambda: 0.9 int8_mask: true rescale: true normalize: false dtype: bfloat16 tokenizer: source: base merge_method: della models: - model: estrogen/MS2501-24b-Ink-apollo-ep2 parameters: weight: [0.1, -0.01, 0.1, -0.02, 0.1] density: [0.6, 0.4, 0.5, 0.4, 0.6] - model: huihui-ai/Mistral-Small-24B-Instruct-2501-abliterated parameters: weight: [0.02, -0.01, 0.02, -0.02, 0.01] density: [0.45, 0.55, 0.45, 0.55, 0.45] - model: ToastyPigeon/ms3-roselily-rp-v2 parameters: weight: [0.01, -0.02, 0.02, -0.025, 0.01] density: [0.45, 0.65, 0.45, 0.65, 0.45] - model: PocketDoc/Dans-DangerousWinds-V1.1.1-24b parameters: weight: [0.1, -0.01, 0.1, -0.02, 0.1] density: [0.6, 0.4, 0.5, 0.4, 0.6] ```