Update README.md
Browse files
README.md
CHANGED
@@ -52,7 +52,7 @@ Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company sp
|
|
52 |
CPT(Continue-Pretraining)->SFT->DPO training model based on Mistral-Nemo-Instruct-2407 through 8 H100-80Gs as a Korean language model <br>
|
53 |
It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
|
54 |
-Tokenizer uses the base model without word expansion<br>
|
55 |
-
-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing,
|
56 |
-128k-Context Window<br>
|
57 |
-Support for Korean Functioncall and Tool Calling<br>
|
58 |
-Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>
|
|
|
52 |
CPT(Continue-Pretraining)->SFT->DPO training model based on Mistral-Nemo-Instruct-2407 through 8 H100-80Gs as a Korean language model <br>
|
53 |
It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
|
54 |
-Tokenizer uses the base model without word expansion<br>
|
55 |
+
-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing, math and decision making<br>
|
56 |
-128k-Context Window<br>
|
57 |
-Support for Korean Functioncall and Tool Calling<br>
|
58 |
-Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>
|