Komodo-Logo

This version of Komodo is a Llama-3.2-3B-Instruct finetuned model on lighteval/MATH-Hard dataset to increase math performance of the base model.

This model is 4bit-quantized. You should import it 8bit if you want to use 3B parameters! Make sure you installed 'bitsandbytes' library before import.

Example Usage:

tokenizer = AutoTokenizer.from_pretrained("suayptalha/Komodo-Llama-3.2-8B")
model = AutoModelForCausalLM.from_pretrained("suayptalha/Komodo-Llama-3.2-8B")

example_prompt = """Below is a math question and its solution:
Question: {}
Solution: {}"""

inputs = tokenizer(
[
    example_prompt.format(
        "", #Question here
        "", #Solution here (for training)
    )
], return_tensors = "pt").to("cuda")

outputs = model.generate(**inputs, max_new_tokens = 50, use_cache = True)
tokenizer.batch_decode(outputs)

Buy Me A Coffee

Downloads last month
140
Safetensors
Model size
1.85B params
Tensor type
F32
·
FP16
·
U8
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for suayptalha/Komodo-Llama-3.2-3B

Quantized
(225)
this model
Adapters
1 model

Collection including suayptalha/Komodo-Llama-3.2-3B