Vui Seng Chua
commited on
Commit
·
edd9ad0
1
Parent(s):
5062f79
Rename README.md
Browse files
README.md
ADDED
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
|
2 |
+
A set of 50% weight-sparse Llama3.1-8B pruned by [Wanda](https://github.com/locuslab/wanda).
|
3 |
+
Model links are in the table below. Models can be loaded as is with Huggingface Transformers.
|
4 |
+
|
5 |
+
### Perplexity
|
6 |
+
![Perplexity over Sparsity](llama3.1-8B_Wanda_sparsity.png)
|
7 |
+
### MMLU (5-shot)
|
8 |
+
| MMLU (5-shot) | Accuracy (%) | Relative to Dense (%) | Model Link|
|
9 |
+
|----------------|--------------|-----------------------|-----------|
|
10 |
+
| Dense | 65.1 | baseline | [Meta-Llama-3.1-8B-wanda-unstructured-0.0](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-unstructured-0.0) |
|
11 |
+
| Unstructured | 50.0 | -15.1 | [Meta-Llama-3.1-8B-wanda-unstructured-0.5](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-unstructured-0.5) |
|
12 |
+
| 4:8 | 39.3 | -25.8 | [Meta-Llama-3.1-8B-wanda-4of8](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-4of8) |
|
13 |
+
| 2:4 | 28.7 | -36.4 | [Meta-Llama-3.1-8B-wanda-2of4](https://huggingface.co/vuiseng9/Meta-Llama-3.1-8B-wanda-2of4) |
|
14 |
+
|