QuantFactory Banner

QuantFactory/gemma-2-27b-it-abliterated-GGUF

This is quantized version of byroneverson/gemma-2-27b-it-abliterated created using llama.cpp

Original Model Card

gemma-2-27b-it-abliterated

Now accepting abliteration requests. If you would like to see a model abliterated, follow me and leave me a message with model link.

This is a new approach for abliterating models using CPU only. I was able to abliterate this model using free kaggle processing with no accelerator.

  1. Obtain refusal direction vector using a quant model with llama.cpp (llama-cpp-python and ggml-python).
  2. Orthogonalize each .safetensors files directly from original repo and upload to a new repo. (one at a time)

Check out the jupyter notebook for details of how this model was abliterated from gemma-2-27b-it.

Logo

Downloads last month
739
GGUF
Model size
27.2B params
Architecture
gemma2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for QuantFactory/gemma-2-27b-it-abliterated-GGUF

Base model

google/gemma-2-27b
Quantized
(8)
this model