Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
37
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
nielsr
HF staff
commited on
Sep 21, 2024
Commit
dfecf91
·
verified
·
1 Parent(s):
2c8b93b
Link model to paper
Browse files
This PR links the model to the paper page.
Files changed (1)
hide
show
README.md
+1
-0
README.md
ADDED
Viewed
@@ -0,0 +1 @@
1
+
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).