Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
assafbk
/
mamba-130m-squad-doc-ret
like
0
Text Generation
Transformers
PyTorch
mamba
long context
arxiv:
2406.14528
License:
mit
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
32ce1a1
mamba-130m-squad-doc-ret
/
config.json
jon12398
added model
08cc9a8
8 months ago
raw
Copy download link
history
blame
Safe
165 Bytes
{
"d_model"
:
768
,
"n_layer"
:
24
,
"vocab_size"
:
50277
,
"ssm_cfg"
:
{
}
,
"rms_norm"
:
true
,
"residual_in_fp32"
:
true
,
"fused_add_norm"
:
true
,
"pad_vocab_size_multiple"
:
8
}