webbigdata
/

C3TR-Adapter

text-generation-inference

Model card Files Files and versions Community

dahara1 commited on Jul 21, 2024

Commit

bfef67e

·

verified ·

1 Parent(s): 38eea11

Update README.md

Files changed (1) hide show

README.md +8 -2

README.md CHANGED Viewed

@@ -96,7 +96,13 @@ from peft import PeftModel
 model_id = "unsloth/gemma-2-9b-it-bnb-4bit"
 peft_model_id = "webbigdata/C3TR-Adapter"
-model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=torch.bfloat16, device_map="auto")
 model = PeftModel.from_pretrained(model = model, model_id = peft_model_id)
 tokenizer = AutoTokenizer.from_pretrained(model_id)
 tokenizer.pad_token = tokenizer.unk_token
@@ -136,7 +142,7 @@ The prompt format is original.
 Version1とVersion2(システムプロンプト追加)とVersion3(特殊トークン追加)ではプロンプトフォーマットも変わっています。
 The prompt format has changed between Version 1, Version2(add sustem prompts) and Version 3(add special tokens).
-プロンプトテンプレート内に余分な空白や改行、特殊トークンの漏れはモデルが誤動作(出力が途切れたり繰り返す、余分な文章が付加される等)に繋がるのでテンプレートにミスがないようにしてください
 Extra spaces, line breaks, and omission of special tokens in the prompt template will cause the model to malfunction (output will be truncated or repeated, extra sentences will be added, etc.), so please make sure there are no errors in the template.
 Instructionsは"Translate Japanese to English."(日英翻訳)と"Translate English to Japanese."(英日翻訳)の2種類です。

 model_id = "unsloth/gemma-2-9b-it-bnb-4bit"
 peft_model_id = "webbigdata/C3TR-Adapter"
+# デバイスがbfloat16をサポートしているかどうかを確認
+if torch.cuda.is_available() and torch.cuda.get_device_capability(0)[0] >= 8:
+    dtype = torch.bfloat16
+else:
+    dtype = torch.float16
+model = AutoModelForCausalLM.from_pretrained(model_id, torch_dtype=dtype, device_map="auto")
 model = PeftModel.from_pretrained(model = model, model_id = peft_model_id)
 tokenizer = AutoTokenizer.from_pretrained(model_id)
 tokenizer.pad_token = tokenizer.unk_token
 Version1とVersion2(システムプロンプト追加)とVersion3(特殊トークン追加)ではプロンプトフォーマットも変わっています。
 The prompt format has changed between Version 1, Version2(add sustem prompts) and Version 3(add special tokens).
+プロンプトテンプレート内に余分な空白や改行、特殊トークンの漏れはモデルの誤動作(出力が途切れたり繰り返す、余分な文章が付加される等)に繋がるのでテンプレートにミスがないようにしてください
 Extra spaces, line breaks, and omission of special tokens in the prompt template will cause the model to malfunction (output will be truncated or repeated, extra sentences will be added, etc.), so please make sure there are no errors in the template.
 Instructionsは"Translate Japanese to English."(日英翻訳)と"Translate English to Japanese."(英日翻訳)の2種類です。