Update README.md
Browse files
README.md
CHANGED
@@ -3,7 +3,7 @@ license: cc-by-sa-4.0
|
|
3 |
---
|
4 |
<a href="https://www.buymeacoffee.com/acrastt" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a>
|
5 |
|
6 |
-
This is [StableLM 3B 4E1T](https://huggingface.co/stabilityai/stablelm-3b-4e1t) finetuned on [EverythingLM Data V3(ShareGPT format)](https://huggingface.co/datasets/acrastt/EverythingLM-V3-ShareGPT) for 2 epochs using QLoRA.
|
7 |
|
8 |
Prompt template:
|
9 |
```
|
@@ -19,4 +19,4 @@ Note that this model have the EOS token of `<|endoftext|>` instead of `<\s>`.
|
|
19 |
[StableLM 3B 4E1T](https://huggingface.co/stabilityai/stablelm-3b-4e1t) by [Stability AI](https://stability.ai/) is licensed under [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/).
|
20 |
|
21 |
Modifications:
|
22 |
-
- Instruction tuned on dataset [EverythingLM Data V3(ShareGPT format)](https://huggingface.co/datasets/acrastt/EverythingLM-V3-ShareGPT) for 2 epochs using QLoRA.
|
|
|
3 |
---
|
4 |
<a href="https://www.buymeacoffee.com/acrastt" target="_blank"><img src="https://cdn.buymeacoffee.com/buttons/v2/default-yellow.png" alt="Buy Me A Coffee" style="height: 60px !important;width: 217px !important;" ></a>
|
5 |
|
6 |
+
This is [StableLM 3B 4E1T](https://huggingface.co/stabilityai/stablelm-3b-4e1t) finetuned on [EverythingLM Data V3(ShareGPT format)](https://huggingface.co/datasets/acrastt/EverythingLM-V3-ShareGPT) for 2 epochs using [QLoRA](https://arxiv.org/abs/2305.14314).
|
7 |
|
8 |
Prompt template:
|
9 |
```
|
|
|
19 |
[StableLM 3B 4E1T](https://huggingface.co/stabilityai/stablelm-3b-4e1t) by [Stability AI](https://stability.ai/) is licensed under [CC BY-SA 4.0](https://creativecommons.org/licenses/by-sa/4.0/).
|
20 |
|
21 |
Modifications:
|
22 |
+
- Instruction tuned on dataset [EverythingLM Data V3(ShareGPT format)](https://huggingface.co/datasets/acrastt/EverythingLM-V3-ShareGPT) for 2 epochs using [QLoRA](https://arxiv.org/abs/2305.14314).
|