Undi95 commited on
Commit
6536ef8
·
verified ·
1 Parent(s): 28e62d7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +13 -2
README.md CHANGED
@@ -66,7 +66,7 @@ By applying refusal directions individually to each layer's tensors:
66
  The more we force refusal directions onto the model:
67
  - The more **neutral** it becomes, but at the risk of becoming **dumber**.
68
  - This underscores the importance of **fine-tuning** after abliterating, to restore functionality and intelligence.
69
- - So despite the script letting choose a scale factor, a too high value will break the model.
70
 
71
  ## Next Steps
72
 
@@ -74,4 +74,15 @@ The abliterated model serves as a **neutral starting point**. Fine-tuning is ess
74
  - Adjust the model to reduce over-censoring.
75
  - Maintain a balance between neutrality and usability.
76
 
77
- This is a **work in progress**, Phi 4 is smoll so I can toy with it.
 
 
 
 
 
 
 
 
 
 
 
 
66
  The more we force refusal directions onto the model:
67
  - The more **neutral** it becomes, but at the risk of becoming **dumber**.
68
  - This underscores the importance of **fine-tuning** after abliterating, to restore functionality and intelligence.
69
+ - So despite the script letting the user choose a **scale factor**, too high value will break the model.
70
 
71
  ## Next Steps
72
 
 
74
  - Adjust the model to reduce over-censoring.
75
  - Maintain a balance between neutrality and usability.
76
 
77
+ This is a **work in progress**, Phi 4 is smoll so I can toy with it.
78
+
79
+ ## Replicate
80
+
81
+ - Install my fork
82
+ - Follow tutorial on github
83
+
84
+ Launch with enough VRAM : python abliterate.py -m /workspace/microsoft_phi-4 -o ./perfect --deccp --flash-attn --device auto --scan-all --resume
85
+
86
+ If you want to use the tensors available here, just put the `refusal_tensors/` folder at the root of the script
87
+
88
+ Rename the tensors as needed. My code is shit, please understand, idea is better than code. Do better. kek.