huggingkot
commited on
Commit
·
5981936
1
Parent(s):
8eff51a
add files
Browse files- .gitattributes +1 -0
- README.md +17 -0
- config.json +3 -0
- generation_config.json +3 -0
- model-00001-of-00002.safetensors +3 -0
- model-00002-of-00002.safetensors +3 -0
- model.safetensors.index.json +3 -0
- special_tokens_map.json +3 -0
- tokenizer.json +3 -0
- tokenizer_config.json +3 -0
.gitattributes
CHANGED
|
@@ -33,3 +33,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
| 33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
| 34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
| 35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
| 36 |
+
*.json filter=lfs diff=lfs merge=lfs -text
|
README.md
ADDED
|
@@ -0,0 +1,17 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
|
| 2 |
+
---
|
| 3 |
+
base_model:
|
| 4 |
+
- mlabonne/NeuralDaredevil-8B-abliterated
|
| 5 |
+
---
|
| 6 |
+
|
| 7 |
+
This is a converted weight from [NeuralDaredevil-8B-abliterated](https://huggingface.co/mlabonne/NeuralDaredevil-8B-abliterated) model in [unsloth 4-bit dynamic quant](https://archive.is/EFz7P) using this [collab notebook](https://colab.research.google.com/drive/1P23C66j3ga49kBRnDNlmRce7R_l_-L5l?usp=sharing).
|
| 8 |
+
|
| 9 |
+
## About this Conversion
|
| 10 |
+
|
| 11 |
+
This conversion uses **Unsloth** to load the model in **4-bit** format and force-save it in the same **4-bit** format.
|
| 12 |
+
|
| 13 |
+
### How 4-bit Quantization Works
|
| 14 |
+
- The actual **4-bit quantization** is handled by **BitsAndBytes (bnb)**, which works under **Torch** via **AutoGPTQ** or **BitsAndBytes**.
|
| 15 |
+
- **Unsloth** acts as a wrapper, simplifying and optimizing the process for better efficiency.
|
| 16 |
+
|
| 17 |
+
This allows for reduced memory usage and faster inference while keeping the model compact.
|
config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:eda4f92eaba847c1a0db7aad3df4dc1bc8271f8e1cc09fae7b4676466192b058
|
| 3 |
+
size 1195
|
generation_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:c72858f1de2c4f4dde5ccbaad42b1740fce3f2490aaf7e1603d0a2eb0f1b018e
|
| 3 |
+
size 220
|
model-00001-of-00002.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:33a27b27770043007d12efbe434c310774de205212f26c6fbe77712ecb4a1681
|
| 3 |
+
size 4652072877
|
model-00002-of-00002.safetensors
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:2c17b8e37a399ec3b94e5a3a035a7334d4747e49d46b8856347312bfd934463d
|
| 3 |
+
size 1050673280
|
model.safetensors.index.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:ecd55482196d05d46c0bff39561b2bac92511222f8fedcb9191c35317daf8029
|
| 3 |
+
size 132271
|
special_tokens_map.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:a40e70b69122f16b8b3295a39bb5e5c2dc0e6159a8182e49452f1e0978187774
|
| 3 |
+
size 350
|
tokenizer.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:3c5cf44023714fb39b05e71e425f8d7b92805ff73f7988b083b8c87f0bf87393
|
| 3 |
+
size 17209961
|
tokenizer_config.json
ADDED
|
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
| 1 |
+
version https://git-lfs.github.com/spec/v1
|
| 2 |
+
oid sha256:bb69dcac6b920c62f44472049c041a95bf65502f1e0f2221077e244a334c8ba5
|
| 3 |
+
size 51060
|