QuantFactory/gemma-2-27b-it-abliterated-GGUF

This is quantized version of byroneverson/gemma-2-27b-it-abliterated created using llama.cpp

Original Model Card

gemma-2-27b-it-abliterated

Now accepting abliteration requests. If you would like to see a model abliterated, follow me and leave me a message with model link.

This is a new approach for abliterating models using CPU only. I was able to abliterate this model using free kaggle processing with no accelerator.

Obtain refusal direction vector using a quant model with llama.cpp (llama-cpp-python and ggml-python).
Orthogonalize each .safetensors files directly from original repo and upload to a new repo. (one at a time)

Check out the jupyter notebook for details of how this model was abliterated from gemma-2-27b-it.

Downloads last month: 739

GGUF

Model size

27.2B params

Architecture

gemma2

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for QuantFactory/gemma-2-27b-it-abliterated-GGUF

Base model

google/gemma-2-27b

Finetuned

google/gemma-2-27b-it

Finetuned

byroneverson/gemma-2-27b-it-abliterated

Quantized

(8)

this model