iMatrix GGUFs for https://huggingface.co/nvidia/Llama-3.1-Nemotron-70B-Instruct-HF

iMat generated using Kalomaze's groups_merged.txt

Downloads last month
72
GGUF
Model size
70.6B params
Architecture
llama
Hardware compatibility
Log In to view the estimation
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF

Quantized
(109)
this model

Dataset used to train MarsupialAI/Llama-3.1-Nemotron-70B-Instruct_iMat_GGUF