allura-org
/

MS3-24B-Roselily-Creative

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

MS3-24B-Roselily-Creative / README.md

ToastyPigeon's picture

Update README.md

d3846c0 verified 9 days ago

|

history blame contribute delete

1.63 kB

	---
	base_model:
	- ToastyPigeon/ms3-roselily-instruct
	library_name: transformers
	tags:
	- mergekit
	- merge

	---
	# todo

	make a model card and put a cute girl on it

	# some info

	Making this public so it can be tried and possibly merged if desired while I work on getting the energy to write a proper card.

	Short list of things to know:
	- This is a bunch of RP, story writing, etc. creative data applied to [ToastyPigeon/ms3-roselily-instruct](https://huggingface.co/ToastyPigeon/ms3-roselily-instruct).
	- Instruct format: ChatML or Alpaca preferred, Tekken v7 possible
	- ChatML tokens were assigned to unused tokens 20 and 21, this leaves all the tekken tokens intact so merges w/ tekken models are feasible
	- Instruct-tuning phase did include Tekken v7 so the tokens are initialized and recognized, but I did not continue with it on the creative step because I do not like it for creative stuff (too restrictive with turn order)
	- Feels a little less sensitive to samplers than Instruct-based MS3 models, but should probably still be used with conservative samplers

	# chat templates

	You may need to set `<\|im_end\|>` and/or `</s>` as stopping strings depending on which format you're using, the model generates both properly but tokenizers can be finicky about what they stop on by default

	Alpaca w/ System
	```
	### System:
	{system prompt}

	### Instruction:
	{user message}

	### Response:
	{model answer}</s>
	```
	ChatML
	```
	<\|im_start\|>system
	{system prompt}<\|im_end\|>
	<\|im_start\|>user
	{user message}<\|im_end\|>
	<\|im_start\|>assistant
	{model answer}<\|im_end\|>
	```
	Also saw some completion training in chat mode and adventure mode.