Safetensors
English
llama
Nessii013 commited on
Commit
f3d9037
·
verified ·
1 Parent(s): a0c5835

Create oumi/oumi_infer.yaml

Browse files
Files changed (1) hide show
  1. oumi/oumi_infer.yaml +24 -0
oumi/oumi_infer.yaml ADDED
@@ -0,0 +1,24 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Inference config for Llama 8B Instruct.
2
+ #
3
+ # Usage:
4
+ # oumi infer -i -c configs/recipes/llama3_1/inference/8b_infer.yaml
5
+ #
6
+ # See Also:
7
+ # - Documentation: https://oumi.ai/docs/en/latest/user_guides/infer/infer.html
8
+ # - Config class: oumi.core.configs.InferenceConfig
9
+ # - Config source: https://github.com/oumi-ai/oumi/blob/main/src/oumi/core/configs/inference_config.py
10
+ # - Other inference configs: configs/**/inference/
11
+
12
+ model:
13
+ model_name: "uiuc-convai/CALM-8B"
14
+ adapter_model: null # Update for LoRA-tuned models.
15
+ model_max_length: 2048 # Update based on your hardware constraints
16
+ torch_dtype_str: "bfloat16"
17
+ attn_implementation: "sdpa"
18
+ load_pretrained_weights: True
19
+ trust_remote_code: True
20
+
21
+ generation:
22
+ max_new_tokens: 2048
23
+
24
+ engine: NATIVE