todo

make a model card and put a cute girl on it

some info

Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card.

Short list of things to know:

  • This is a bunch of RP, story writing, etc. creative data applied to ToastyPigeon/ms3-roselily-instruct.
  • Instruct format: ChatML or Alpaca preferred, Tekken v7 possible
  • ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible
  • Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order)
  • Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers

chat templates

You may need to set <|im_end|> and/or </s> as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default

Alpaca w/ System

### System:
{system prompt}

### Instruction:
{user message}

### Response:
{model answer}</s>

ChatML

<|im_start|>system
{system prompt}<|im_end|>
<|im_start|>user
{user message}<|im_end|>
<|im_start|>assistant
{model answer}<|im_end|>

Also saw some completion training in chat mode and adventure mode.

Downloads last month
104
Safetensors
Model size
23.6B params
Tensor type
FP16
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for allura-org/MS3-24B-Roselily-Creative

Finetuned
(2)
this model
Merges
2 models
Quantizations
5 models