added disable_exllama=true
#1
by
freeEDU
- opened
Error loading model: Found modules on cpu/disk. Using Exllama or Exllamav2 backend requires all the modules to be on GPU.You can deactivate exllama backend by setting disable_exllama=True in the quantization config object