--- base_model: - SicariusSicariiStuff/Negative_LLAMA_70B - TheDrummer/Nautilus-70B-v0.1 - KaraKaraWitch/Llama-3.X-Workout-70B - Steelskull/L3.3-Nevoria-R1-70b - Tarek07/Inception-LLaMa-70B library_name: transformers tags: - mergekit - merge --- # MagicalGirl No image for this model. A auditory replacement has been [provided.](https://files.catbox.moe/7z2w06.flac) This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ### Sampling Settings I keep playing around with sampler settings more often than not due to model not being super creative or just overly verbose. Anyway, I landed on the following for this model: ``` Temperature: 1.4 Min P: 0.03 ``` This applies retroactively to [KaraKaraWitch/Llama-3.X-Workout-70B](https://huggingface.co/KaraKaraWitch/Llama-3.X-Workout-70B) as well. ## Notes - Seems to fit my requirements for the most part. Not too sure how exactly others would feel but I find that this is the model I envisioned. - **Without** a system prompt, **the model can get quite offensive and dark.** Consider writing a simple system prompt before using. ## Merge Details ### Merge Method This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [KaraKaraWitch/Llama-3.X-Workout-70B](https://huggingface.co/KaraKaraWitch/Llama-3.X-Workout-70B) as a base. ### Models Merged The following models were included in the merge: * [SicariusSicariiStuff/Negative_LLAMA_70B](https://huggingface.co/SicariusSicariiStuff/Negative_LLAMA_70B) * [TheDrummer/Nautilus-70B-v0.1](https://huggingface.co/TheDrummer/Nautilus-70B-v0.1) * [Steelskull/L3.3-Nevoria-R1-70b](https://huggingface.co/Steelskull/L3.3-Nevoria-R1-70b) * [Tarek07/Inception-LLaMa-70B](https://huggingface.co/Tarek07/Inception-LLaMa-70B) ### Configuration The following YAML configuration was used to produce this model: ```yaml models: - model: SicariusSicariiStuff/Negative_LLAMA_70B - model: TheDrummer/Nautilus-70B-v0.1 - model: Tarek07/Inception-LLaMa-70B - model: Steelskull/L3.3-Nevoria-R1-70b merge_method: sce base_model: KaraKaraWitch/Llama-3.X-Workout-70B parameters: select_topk: 1.0 dtype: bfloat16 ```