FluidInference
/

phi-4-mini-instruct-int4-ov-npu

Text Generation

Model card Files Files and versions

bweng commited on Jun 25

Commit

1ffddaa

·

verified ·

1 Parent(s): 8a0fd67

Update README.md

Files changed (1) hide show

README.md +0 -13

README.md CHANGED Viewed

@@ -42,19 +42,6 @@ base_model_relation: quantized
 This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) model converted to the [OpenVINO™ IR](https://docs.openvino.ai/2025/documentation/openvino-ir-format.html) (Intermediate Representation) format with weights compressed to INT4 by [NNCF](https://github.com/openvinotoolkit/nncf).
-## Quantization Parameters
-Weight compression was performed using `nncf.compress_weights` with the following parameters:
-* mode: **INT4_ASYM**
-* ratio: **1.0**
-* group_size: **64**
-* awq: **True**
-* scale_estimation: **True**
-* dataset: [wikitext2](https://huggingface.co/datasets/mindchain/wikitext2)
-For more information on quantization, check the [OpenVINO model optimization guide](https://docs.openvino.ai/2025/openvino-workflow/model-optimization-guide/weight-compression.html)
 ## Compatibility
 The provided OpenVINO™ IR model is compatible with:
 * OpenVINO version 2025.2.0 and higher

 This is [Phi-4-mini-instruct](https://huggingface.co/microsoft/Phi-4-mini-instruct) model converted to the [OpenVINO™ IR](https://docs.openvino.ai/2025/documentation/openvino-ir-format.html) (Intermediate Representation) format with weights compressed to INT4 by [NNCF](https://github.com/openvinotoolkit/nncf).
 ## Compatibility
 The provided OpenVINO™ IR model is compatible with:
 * OpenVINO version 2025.2.0 and higher