Model Stock: All we need is just a few fine-tuned models
Paper
•
2403.19522
•
Published
•
13
This is a merge of pre-trained language models created using mergekit.
The new MODEL STOCK merge method was used, see below for more information!
Feedback on this model is greatly appreciated! I hope this new merge method will be able to fill some hole Miqu have.
Thank you all!
Since it was made with model using different prompt format, the following should work.
### Instruction:
{system prompt}
### Input:
{prompt}
### Response:
{output}
[INST] {prompt} [/INST]
SYSTEM: <ANY SYSTEM CONTEXT>
USER:
ASSISTANT:
This model was merged using the Model Stock merge method using 152334H/miqu-1-70b-sf as a base.
The following models were included in the merge:
The following YAML configuration was used to produce this model:
models:
- model: NeverSleep/MiquMaid-v2-70B
- model: sophosympatheia/Midnight-Miqu-70B-v1.0
- model: migtissera/Tess-70B-v1.6
- model: 152334H/miqu-1-70b-sf
merge_method: model_stock
base_model: 152334H/miqu-1-70b-sf
dtype: bfloat16
If you want to support me, you can here.