ToastyPigeon
/

ms3-roselily-instruct

Model card Files Files and versions Community

ToastyPigeon commited on 17 days ago

Commit

a8ff7a4

·

verified ·

1 Parent(s): 0c3f464

Update README.md

Files changed (1) hide show

README.md +11 -1

README.md CHANGED Viewed

@@ -1,3 +1,13 @@
 This is a double fine-tuned version of Mistral Small 24B Base 2501.
 Stage 1 was shoving 30M tokens of human-writen story content into it using completion training ([ToastyPigeon/ms3-base-roselily](https://huggingface.co/ToastyPigeon/ms3-base-roselily)), which is about half of my WIP Roselily dataset (~60M tokens total).
@@ -33,4 +43,4 @@ This model should accept (in theory) any of the following instruct formats:
 The Tekken tokens were already in the tokenizer. unused special tokens #20 and 21 were repurposed for the ChatML tokens. Fizzpaca did not add any.
-You may need to add both `</s>` and `<|im_end|>` as stop tokens for it to work properly with all formats.

+---
+license: apache-2.0
+datasets:
+- ToastyPigeon/roselily-v0
+- PocketDoc/Dans-Systemmaxx
+- allenai/tulu-3-sft-personas-instruction-following
+- ZeusLabs/WizardLM_evol_instruct_fuzzy_dedup_sharegpt
+base_model:
+- mistralai/Mistral-Small-24B-Base-2501
+---
 This is a double fine-tuned version of Mistral Small 24B Base 2501.
 Stage 1 was shoving 30M tokens of human-writen story content into it using completion training ([ToastyPigeon/ms3-base-roselily](https://huggingface.co/ToastyPigeon/ms3-base-roselily)), which is about half of my WIP Roselily dataset (~60M tokens total).
 The Tekken tokens were already in the tokenizer. unused special tokens #20 and 21 were repurposed for the ChatML tokens. Fizzpaca did not add any.
+You may need to add both `</s>` and `<|im_end|>` as stop tokens for it to work properly with all formats.