42dot
/

42dot_LLM-SFT-1.3B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ykhwang commited on Sep 11, 2023

Commit

0af0692

·

1 Parent(s): 13052b8

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -26,7 +26,7 @@ As same as 42dot-PLM, the model is built upon a Transformer decoder architecture
 (\* unit: tokens)
 ### Supervised Fine-tuning
-Fine-tuning took about 4 hours using 8 * NVIDIA A100 GPUs. For training dataset, we manually constructed  (question/insturuction) and response pairs, which can either be single- or multi-turn.
 ### Evaluation
 Inspired by recent attempts like [Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/#how-good-is-vicuna), we evaluate 42dot-PLM with other proprietary/open-sourced chatbots using GPT-4 for assesing various aspects of responses. The sample of evaluation dataset and prompt template can be downloaded from our [GitHub repo](https://github.com/42dot/42dot_LLM).

 (\* unit: tokens)
 ### Supervised Fine-tuning
+Fine-tuning took about 10 hours using 8 * NVIDIA A100 GPUs. For training dataset, we manually constructed  (question/insturuction) and response pairs, which can either be single- or multi-turn.
 ### Evaluation
 Inspired by recent attempts like [Vicuna](https://lmsys.org/blog/2023-03-30-vicuna/#how-good-is-vicuna), we evaluate 42dot-PLM with other proprietary/open-sourced chatbots using GPT-4 for assesing various aspects of responses. The sample of evaluation dataset and prompt template can be downloaded from our [GitHub repo](https://github.com/42dot/42dot_LLM).