Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
fla-hub
/
gsa-1.3B-100B
like
0
Follow
fla-hub
36
Text Generation
Safetensors
cerebras/SlimPajama-627B
English
fla
gsa
arxiv:
2409.07146
License:
mit
Model card
Files
Files and versions
Community
1
main
gsa-1.3B-100B
Commit History
Upload GSAForCausalLM
1e4ffda
verified
yzhangcs
commited on
16 days ago
Remove the `norm_first` option
7c18483
yzhangcs
commited on
19 days ago
Update README.md
39d60a1
verified
yzhangcs
commited on
Sep 30, 2024
Update README.md
781baa4
verified
yzhangcs
commited on
Sep 30, 2024
Link model to paper (
#1
)
63d0c7d
verified
yzhangcs
nielsr
HF staff
commited on
Sep 22, 2024
Update tokenizer_config.json
2c8b93b
verified
yzhangcs
commited on
Sep 2, 2024
Upload GSAForCausalLM
023f4e2
verified
yzhangcs
commited on
Jun 7, 2024
initial commit
f060f9a
verified
yzhangcs
commited on
Jun 7, 2024