base_model:
- nothingiisreal/MN-12B-Starcannon-v3
- MarinaraSpaghetti/NemoMix-Unleashed-12B
library_name: transformers
tags:
- mergekit
- merge
license: cc-by-nc-4.0
Starcannon Unleashed 12B v1.0
Introduction
Sample Output
Instruct
Both ChatML and Mistral should work fine. Personally, I tested this using ChatML. I found that I like the model's responses better when I use this format. Try to test it out and observe which one you like best. :D
Settings
I recommend using either of these setings:
ST-formatting-2024-10-29-01.json or ST-formatting-2024-10-29-02.json
IMPORTANT: Master Import in Silly Tavern. Replace "INSERT WORLD HERE" with the world/universe in which your charcater belongs to. If not applicable, just remove that part.
The only difference between the two are their Temperature settings. Again, play around with it to see what suits your taste.
Both are modified version of MarinaraSpaghetti's Mistral-Small-Correct.json, transformed into ChatML.
You can find the original version here: MarinaraSpaghetti/SillyTavern-Settings
Tips
- Example Messeges I find that it's very important and the model really does copy what it's given, so if you want short outputs, make the first message and example messages short, and so on.
Merge Details
This is a merge of pre-trained language models created using mergekit.
Merge Method
This model was merged using the della_linear merge method using G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B as a base.
Models Merged
The following models were included in the merge:
- G:\text-generation-webui\models\Nothingiisreal_MN-12B-Starcannon-v3
Configuration
The following YAML configuration was used to produce this model:
base_model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
dtype: bfloat16
merge_method: della_linear
parameters:
epsilon: 0.05
int8_mask: 1.0
lambda: 1.0
slices:
- sources:
- layer_range: [0, 40]
model: G:\text-generation-webui\models\MarinaraSpaghetti_NemoMix-Unleashed-12B
parameters:
density: 0.65
weight: 0.4
- layer_range: [0, 40]
model: G:\text-generation-webui\models\Nothingiisreal_MN-12B-Starcannon-v3
parameters:
density: 0.55
weight: 0.6