iqwiki-kor
/

Llama3.2-3B-MP-RM

Generated from Trainer

Model card Files Files and versions Metrics Training metrics Community

JW17 commited on Oct 3, 2024

Commit

20eefd0

·

verified ·

1 Parent(s): 6ccdd83

End of training

Files changed (2) hide show

README.md +1 -1
config.json +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 # L32-3B-It-E80
-This model is a fine-tuned version of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) on an unknown dataset.
 ## Model description

 # L32-3B-It-E80
+This model is a fine-tuned version of [meta-llama/Llama-3.2-3B-Instruct](https://huggingface.co/meta-llama/Llama-3.2-3B-Instruct) on the iqwiki-kor/mhp-108k dataset.
 ## Model description

config.json CHANGED Viewed

@@ -42,6 +42,6 @@
   "tie_word_embeddings": true,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.43.4",
-  "use_cache": false,
   "vocab_size": 128257
 }

   "tie_word_embeddings": true,
   "torch_dtype": "bfloat16",
   "transformers_version": "4.43.4",
+  "use_cache": true,
   "vocab_size": 128257
 }