mapama247
commited on
Commit
·
d3157b9
1
Parent(s):
f884579
add num query groups to table
Browse files
README.md
CHANGED
@@ -83,6 +83,7 @@ The full list of hyperparameters for each model can be found [here](https://gith
|
|
83 |
| Layer normalization | RMS Norm |
|
84 |
| Flash attention | ✅ |
|
85 |
| Grouped Query Attention | ✅ |
|
|
|
86 |
|
87 |
---
|
88 |
|
@@ -629,4 +630,4 @@ The Barcelona Supercomputing Center, as the owner and creator of the model, shal
|
|
629 |
|:---:|:---:|:---:|
|
630 |
|2B| WiP | WiP |
|
631 |
|7B| [Link](https://huggingface.co/projecte-aina/salamandra-7b) | [Link](https://huggingface.co/projecte-aina/salamandra-7b-instruct) |
|
632 |
-
|40B| WiP | WiP |
|
|
|
83 |
| Layer normalization | RMS Norm |
|
84 |
| Flash attention | ✅ |
|
85 |
| Grouped Query Attention | ✅ |
|
86 |
+
| Num. query groups | 8 |
|
87 |
|
88 |
---
|
89 |
|
|
|
630 |
|:---:|:---:|:---:|
|
631 |
|2B| WiP | WiP |
|
632 |
|7B| [Link](https://huggingface.co/projecte-aina/salamandra-7b) | [Link](https://huggingface.co/projecte-aina/salamandra-7b-instruct) |
|
633 |
+
|40B| WiP | WiP |
|