Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Llama-3.1-Minitron-4B-Width-Base-GGUF
like
1
Follow
Quant Factory
372
GGUF
Inference Endpoints
arxiv:
2408.11796
arxiv:
2009.03300
arxiv:
2407.14679
License:
nvidia-open-model-license
Model card
Files
Files and versions
Community
Deploy
Use this model
8dbd08c
Llama-3.1-Minitron-4B-Width-Base-GGUF
1 contributor
History:
4 commits
aashish1904
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
8dbd08c
verified
6 months ago
.gitattributes
Safe
1.68 kB
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
6 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf
Safe
2.91 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_1.gguf with huggingface_hub
6 months ago
Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf
Safe
2.78 GB
LFS
Upload Llama-3.1-Minitron-4B-Width-Base.Q4_K_M.gguf with huggingface_hub
6 months ago
README.md
Safe
6.19 kB
Upload README.md with huggingface_hub
6 months ago