kaitchup/QwQ-32B-AutoRoundGPTQ-2bit

Model Details

This is Qwen/QwQ-32B quantized with AutoRound (symmetric quantization) and serialized with the GPTQ format in 2-bit (group size of 32). The model has been created, tested, and evaluated by The Kaitchup. The model is compatible with vLLM and Transformers.

Details on the quantization process and how to use the model here: The Kaitchup

Developed by: The Kaitchup
Language(s) (NLP): English
License: Apache 2.0 license

How to Support My Work

Subscribe to The Kaitchup. This helps me a lot to continue quantizing and evaluating models for free.

kaitchup
/

QwQ-32B-AutoRoundGPTQ-2bit

Model Details

How to Support My Work

Model tree for kaitchup/QwQ-32B-AutoRoundGPTQ-2bit

Collection including kaitchup/QwQ-32B-AutoRoundGPTQ-2bit

Quantized QwQ 32B