metadata
base_model: unsloth/Llama-3.2-1B-Instruct
language:
- en
library_name: transformers
license: llama3.2
tags:
- llama-3
- llama
- meta
- facebook
- unsloth
- transformers
- openvino
- nncf
- 4-bit
This model is a quantized version of unsloth/Llama-3.2-1B-Instruct
and is converted to the OpenVINO format. This model was obtained via the nncf-quantization space with optimum-intel.
First make sure you have optimum-intel
installed:
pip install optimum[openvino]
To load your model you can do as follows:
from optimum.intel import OVModelForCausalLM
model_id = "Anonymous6598/Llama-3.2-1B-Instruct-openvino-4bit"
model = OVModelForCausalLM.from_pretrained(model_id)