Genearted from https://github.com/yhyu13/AutoGPTQ.git branch cuda_dev

Original weight: https://huggingface.co/tiiuae/falcon-7b-instruct

Note, autogptq does not generate 128 group size successfully when evaluating, at this moment. So the group size is 64

Downloads last month
4
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API does not yet support model repos that contain custom code.