todo
make a model card and put a cute girl on it
some info
Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card.
Short list of things to know:
- This is a bunch of RP, story writing, etc. creative data applied to ToastyPigeon/ms3-roselily-instruct.
- Instruct format: ChatML or Alpaca preferred, Tekken v7 possible
- ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible
- Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order)
- Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers
chat templates
You may need to set <|im_end|>
and/or </s>
as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default
Alpaca w/ System
### System:
{system prompt}
### Instruction:
{user message}
### Response:
{model answer}</s>
ChatML
<|im_start|>system
{system prompt}<|im_end|>
<|im_start|>user
{user message}<|im_end|>
<|im_start|>assistant
{model answer}<|im_end|>
Also saw some completion training in chat mode and adventure mode.
- Downloads last month
- 104
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.
Model tree for allura-org/MS3-24B-Roselily-Creative
Base model
mistralai/Mistral-Small-24B-Base-2501
Finetuned
ToastyPigeon/ms3-roselily-instruct