Tamnemtf
/

Ae-calem-mistral-7b-v0.2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Tamnemtf commited on Apr 17, 2024

Commit

be843ac

·

verified ·

1 Parent(s): e560498

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -39,6 +39,17 @@ This model is a improve from the old one. It's have the new the tokenizer_config
 Our dataset was make base on our university sudent notebook. It includes majors, university regulations and other information about our university.
 [hcmue_qa](https://huggingface.co/datasets/Tamnemtf/hcmue_qa)
 ### Training Procedure
 ```python

 Our dataset was make base on our university sudent notebook. It includes majors, university regulations and other information about our university.
 [hcmue_qa](https://huggingface.co/datasets/Tamnemtf/hcmue_qa)
+## Instruction Format
+In order to leverage instruction fine-tuning, your prompt should be surrounded by <|im_start|> and <|im_end|> tokens. The very first instruction should begin with a begin of sentence id. The next instructions should not. The assistant generation will be ended by the end-of-sentence token id.
+E.g.
+```python
+role ="user"
+prompt ="hi"
+chatml = f"<|im_start|>{role}\n{prompt}<|im_end|>\n"
+```
+Here is the [dataset](https://huggingface.co/datasets/Tamnemtf/hcmue-new-template) after adding this format.
 ### Training Procedure
 ```python