Intel
/

Qwen2-0.5B-Instruct-int4-sym-AutoRound

4-bit precision

Model card Files Files and versions

Qwen2-0.5B-Instruct-int4-sym-AutoRound / README.md

wenhuach's picture

Update README.md

895ed25 verified 4 months ago

|

history blame contribute delete

281 Bytes

	---
	license: apache-2.0
	---
	This model is an int4 model with group_size 128 and symmetric quantization of [Qwen/Qwen2-0.5B-Instruct](https://huggingface.co/Qwen/Qwen2-0.5B-Instruct) generated by [intel/auto-round](https://github.com/intel/auto-round) algorithm.

	Mainly for vllm ut