scb10x
/

llama-3-typhoon-v1.5x-70b-instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

kunato commited on May 31, 2024

Commit

a55bf4e

·

verified ·

1 Parent(s): 4f7a3ff

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -95,7 +95,7 @@ model = AutoModelForCausalLM.from_pretrained(
     model_id,
     torch_dtype=torch.bfloat16,
     device_map="auto",
-)
 messages = [...] # add message here

     model_id,
     torch_dtype=torch.bfloat16,
     device_map="auto",
+) # We don't recommend using BNB 4-bit (load_in_4bit) here. Instead, use AWQ, as detailed here: https://huggingface.co/scb10x/llama-3-typhoon-v1.5x-70b-instruct-awq.
 messages = [...] # add message here