added disable_exllama=true

#1
by freeEDU - opened

Error loading model: Found modules on cpu/disk. Using Exllama or Exllamav2 backend requires all the modules to be on GPU.You can deactivate exllama backend by setting disable_exllama=True in the quantization config object

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment