Update README.md
Browse files
README.md
CHANGED
@@ -66,7 +66,7 @@ By applying refusal directions individually to each layer's tensors:
|
|
66 |
The more we force refusal directions onto the model:
|
67 |
- The more **neutral** it becomes, but at the risk of becoming **dumber**.
|
68 |
- This underscores the importance of **fine-tuning** after abliterating, to restore functionality and intelligence.
|
69 |
-
- So despite the script letting choose a scale factor
|
70 |
|
71 |
## Next Steps
|
72 |
|
@@ -74,4 +74,15 @@ The abliterated model serves as a **neutral starting point**. Fine-tuning is ess
|
|
74 |
- Adjust the model to reduce over-censoring.
|
75 |
- Maintain a balance between neutrality and usability.
|
76 |
|
77 |
-
This is a **work in progress**, Phi 4 is smoll so I can toy with it.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
66 |
The more we force refusal directions onto the model:
|
67 |
- The more **neutral** it becomes, but at the risk of becoming **dumber**.
|
68 |
- This underscores the importance of **fine-tuning** after abliterating, to restore functionality and intelligence.
|
69 |
+
- So despite the script letting the user choose a **scale factor**, too high value will break the model.
|
70 |
|
71 |
## Next Steps
|
72 |
|
|
|
74 |
- Adjust the model to reduce over-censoring.
|
75 |
- Maintain a balance between neutrality and usability.
|
76 |
|
77 |
+
This is a **work in progress**, Phi 4 is smoll so I can toy with it.
|
78 |
+
|
79 |
+
## Replicate
|
80 |
+
|
81 |
+
- Install my fork
|
82 |
+
- Follow tutorial on github
|
83 |
+
|
84 |
+
Launch with enough VRAM : python abliterate.py -m /workspace/microsoft_phi-4 -o ./perfect --deccp --flash-attn --device auto --scan-all --resume
|
85 |
+
|
86 |
+
If you want to use the tensors available here, just put the `refusal_tensors/` folder at the root of the script
|
87 |
+
|
88 |
+
Rename the tensors as needed. My code is shit, please understand, idea is better than code. Do better. kek.
|