mapama247 commited on
Commit
d3157b9
·
1 Parent(s): f884579

add num query groups to table

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -83,6 +83,7 @@ The full list of hyperparameters for each model can be found [here](https://gith
83
  | Layer normalization | RMS Norm |
84
  | Flash attention | ✅ |
85
  | Grouped Query Attention | ✅ |
 
86
 
87
  ---
88
 
@@ -629,4 +630,4 @@ The Barcelona Supercomputing Center, as the owner and creator of the model, shal
629
  |:---:|:---:|:---:|
630
  |2B| WiP | WiP |
631
  |7B| [Link](https://huggingface.co/projecte-aina/salamandra-7b) | [Link](https://huggingface.co/projecte-aina/salamandra-7b-instruct) |
632
- |40B| WiP | WiP |
 
83
  | Layer normalization | RMS Norm |
84
  | Flash attention | ✅ |
85
  | Grouped Query Attention | ✅ |
86
+ | Num. query groups | 8 |
87
 
88
  ---
89
 
 
630
  |:---:|:---:|:---:|
631
  |2B| WiP | WiP |
632
  |7B| [Link](https://huggingface.co/projecte-aina/salamandra-7b) | [Link](https://huggingface.co/projecte-aina/salamandra-7b-instruct) |
633
+ |40B| WiP | WiP |