Saxo
/

Linkbricks-Horizon-AI-Korean-Advanced-12B

Text Generation

text-generation-inference

Model card Files Files and versions Community

Saxo commited on Sep 27, 2024

Commit

e5e095a

·

verified ·

1 Parent(s): 247f47f

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -52,7 +52,7 @@ Finetuned by Mr. Yunsung Ji (Saxo), a data scientist at Linkbricks, a company sp
 CPT(Continue-Pretraining)->SFT->DPO training model based on Mistral-Nemo-Instruct-2407 through 8 H100-80Gs as a Korean language model <br>
 It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
 -Tokenizer uses the base model without word expansion<br>
--Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing, amth and decision making<br>
 -128k-Context Window<br>
 -Support for Korean Functioncall and Tool Calling<br>
 -Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>

 CPT(Continue-Pretraining)->SFT->DPO training model based on Mistral-Nemo-Instruct-2407 through 8 H100-80Gs as a Korean language model <br>
 It is a model that has been trained to handle Korean-Chinese-English-Japanese cross-training data and 10M korean news corpus and logic judgment data for various tasks to enable cross-fertilization processing and complex Korean logic & math problems. <br>
 -Tokenizer uses the base model without word expansion<br>
+-Models enhanced with high-dimensional analysis of customer reviews and social posts, as well as coding, writing, math and decision making<br>
 -128k-Context Window<br>
 -Support for Korean Functioncall and Tool Calling<br>
 -Deepspeed Stage=3, use rslora and BAdam Layer Mode<br>