Upload README_HF_SFT_1.3B.md
Browse files- README_HF_SFT_1.3B.md +74 -0
README_HF_SFT_1.3B.md
ADDED
@@ -0,0 +1,74 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language:
|
3 |
+
- en
|
4 |
+
- ko
|
5 |
+
pipeline_tag: text-generation
|
6 |
+
inference: false
|
7 |
+
tags:
|
8 |
+
- pytorch
|
9 |
+
- llama
|
10 |
+
- causal-lm
|
11 |
+
- 42dot-llm
|
12 |
+
license: cc-by-nc-4.0
|
13 |
+
---
|
14 |
+
# 42dot-SFT 1.3B
|
15 |
+
|
16 |
+
**42dot-SFT** is a large language model (LLM) developed by [**42dot**](https://42dot.ai/) which is trained to follow natural language instructions, and derived from **42dot-PLM** by supervised fine-tuning (SFT). This repository contains a 1.3B-parameter version.
|
17 |
+
|
18 |
+
## Model Description
|
19 |
+
|
20 |
+
### Hyperparameters
|
21 |
+
As same as 42dot-PLM, the model is built upon a Transformer decoder architecture similar to the [LLaMA 2](https://ai.meta.com/research/publications/llama-2-open-foundation-and-fine-tuned-chat-models/) and its hyperparameters are listed below.
|
22 |
+
|
23 |
+
| Params | Layers | Attention heads | Hidden size | FFN size | Max. length\* |
|
24 |
+
| -- | -- | -- | -- | -- | -- |
|
25 |
+
| 1.3B | 24 | 32 | 2,048 | 5,632 | 8,192 |
|
26 |
+
|
27 |
+
(\* unit: tokens)
|
28 |
+
### Supervised Fine-tuning
|
29 |
+
|
30 |
+
Fine-tuning took about 4 hours using 8 * NVIDIA A100 GPUs. For training dataset, we manually constructed (question/insturuction) and response pairs, which can either be single- or multi-turn.
|
31 |
+
|
32 |
+
### Evaluation
|
33 |
+
Inspired by recent attempts like [Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/#how-good-is-vicuna), we evaluate 42dot-PLM with other proprietary/open-sourced chatbots using GPT-4 for assesing various aspects of responses. The sample of evaluation dataset and prompt template can be downloaded from our [GitHub repo](https://github.com/42dot/42dot_LLM).
|
34 |
+
|
35 |
+
- Baselines:
|
36 |
+
- [ChatGPT](https://chat.openai.com/) using GPT-3.5-turbo and GPT-4
|
37 |
+
- [Bard](https://bard.google.com/)
|
38 |
+
- [KORani-v2-13B](https://huggingface.co/KRAFTON/KORani-v1-13B)
|
39 |
+
|
40 |
+
| Model | GPT-3.5 | GPT-4 | Bard | KORani | 42dot-SFT |
|
41 |
+
| :-- |:-------:|:--------:|:--------:|:------:|:---------:|
|
42 |
+
| Params | Unknown | Unknown | Unknown | 13B | 1.3B |
|
43 |
+
|
44 |
+
<figure align="center">
|
45 |
+
<img src="https://huggingface.co/42dot/42dot-sft-1.3b/resolve/main/asset/Ko-Score.png" width="90%" height="70%"/>
|
46 |
+
<figcaption><b>Response quality evaluation result</b></figcaption>
|
47 |
+
</figure>
|
48 |
+
|
49 |
+
<figure align="center">
|
50 |
+
<img src="https://huggingface.co/42dot/42dot-sft-1.3b/resolve/main/asset/42dot-SFT-vs.png" width="90%" height="70%"/>
|
51 |
+
<figcaption><b>Comparison between proprietary chatbots and 42dot-SFT</b></figcaption>
|
52 |
+
</figure>
|
53 |
+
|
54 |
+
|
55 |
+
## Limitations and Ethical Considerations
|
56 |
+
42dot-SFT shares a number of well-known limitations of other LLMs. For example, it may generate false and misinformative contents since 42dot-SFT is also subject to [hallucination](https://en.wikipedia.org/wiki/Hallucination_(artificial_intelligence)). In addition, 42dot-SFT may generate toxic, harmful and biased contents due to use of web-available training corpus in pre-training phase. We strongly suggest that 42dot-SFT users should beware of those limitations and take necessary steps for mitigating those issues.
|
57 |
+
|
58 |
+
## Disclaimer
|
59 |
+
The contents generated by 42dot LLM series ("42dot LLMs") do not necessarily reflect the views or opinions of 42dot Inc. ("42dot"). 42dot disclaims any and all liability to any part for any direct, indirect, implied, punitive, special, incidental or other consequential damages arising any use of the 42dot LLMs and theirs generated contents.
|
60 |
+
|
61 |
+
## License
|
62 |
+
The 42dot-SFT is licensed under the Creative Commons Attribution-NonCommercial 4.0 (CC BY-NC 4.0) license.
|
63 |
+
|
64 |
+
## Citation
|
65 |
+
|
66 |
+
```
|
67 |
+
@misc{42dot2023lm,
|
68 |
+
title={42dot LM: Instruction Tuned Large Language Model of 42dot},
|
69 |
+
author={Woo-Jong Ryu and Sang-Kil Park and Jinwoo Park and Sungmin Lee and Yongkeun Hwang},
|
70 |
+
year={2023},
|
71 |
+
url = {https://gitlab.42dot.ai/NLP/hyperai/ChatBaker},
|
72 |
+
version = {pre-release},
|
73 |
+
}
|
74 |
+
```
|