QuantFactory
/

CodeLlama-7b-hf-GGUF

Text Generation

text-generation-inference

Model card Files Files and versions

CodeLlama-7b-hf-GGUF

Quantized version of CodeLlama-7b-hf
Created using llama.cpp

Available Quants

Q2_K
Q3_K_L
Q3_K_M
Q3_K_S
Q4_0
Q4_K_M
Q4_K_S
Q5_0
Q5_K_M
Q5_K_S
Q6_K
Q8_0

ReadMe format inspired from mlabonne

Downloads last month: 78

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

Log In to view the estimation

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Model tree for QuantFactory/CodeLlama-7b-hf-GGUF

Base model

codellama/CodeLlama-7b-hf

Quantized

(17)

this model