Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
SQCU
/
pgptlformer-tinystories
like
0
roneneldan/TinyStories
Model card
Files
Files and versions
Community
main
pgptlformer-tinystories
/
dyn_qkrmsnorm_ii-7a038ecd-be98-46cb-abe8-e0f013fd7eed
1 contributor
History:
1 commit
SQCU
sling the illustrious and mysterious "attention_II" models. also some layerwise rmsnorm, qkprojection rmsnorm models, one twice as large as the other.
1f45909
verified
about 2 months ago
state_step006250.pt
pickle
Detected Pickle imports (4)
"collections.OrderedDict"
,
"torch.FloatStorage"
,
"torch.ByteStorage"
,
"torch._utils._rebuild_tensor_v2"
What is a pickle import?
335 MB
LFS
sling the illustrious and mysterious "attention_II" models. also some layerwise rmsnorm, qkprojection rmsnorm models, one twice as large as the other.
about 2 months ago