useful sharded checkpoints for users to run inference / fine-tuning on a Google colab without having to deal with CPU OOM issues.
Younes B
ybelkada
AI & ML interests
Large Language Models, Quantization, Vision, Multimodality, Diffusion models
Recent Activity
updated
a model
about 20 hours ago
tiiuae/dense-3b-arch1
updated
a model
about 20 hours ago
tiiuae/dense-3b-arch2
new activity
about 20 hours ago
tiiuae/dense-3b-arch2:Create config.json