Text Generation
Transformers
Safetensors
English
stripedhyena
custom_code
Zymrael commited on
Commit
51d76d4
·
1 Parent(s): f6851b9

chore: add info on dtypes

Browse files
Files changed (1) hide show
  1. README.md +1 -0
README.md CHANGED
@@ -32,3 +32,4 @@ StripedHyena is a hybrid architecture composed of multi-head, grouped-query atte
32
 
33
  To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
34
 
 
 
32
 
33
  To use StripedHyena outside of the playground, you will need to install custom kernels. Please follow the instructions from the [standalone repository](https://github.com/togethercomputer/stripedhyena).
34
 
35
+ StripedHyena is a mixed precision model. Make sure to keep your `poles` and `residues` in `float32` precision, especially for longer prompts or training.