gsa-1.3B-100B / README.md
yzhangcs's picture
Link model to paper (#1)
63d0c7d verified
|
raw
history blame
128 Bytes
Model of the paper [Gated Slot Attention for Efficient Linear-Time Sequence Modeling](https://huggingface.co/papers/2409.07146).