RedHatAI
/

Llama-3.1-8B-Instruct-FP8-block

Text Generation

compressed-tensors

Model card Files Files and versions

Llama-3.1-8B-Instruct-FP8-block

9.1 GB

3 contributors

History: 20 commits

alexmarques's picture

Update README.md

b4040dc verified about 1 month ago

.gitattributes

1.61 kB

Add Llama 3.1 8B Instruct FP8-block model weights and tokenizer about 2 months ago
README.md

7.42 kB

Update README.md about 1 month ago
chat_template.jinja

4.61 kB

Add FP8 block quantized model weights about 2 months ago
config.json

2.09 kB
xet

Add FP8 block quantized model weights about 2 months ago
generation_config.json

184 Bytes
xet

Add FP8 block quantized model weights about 2 months ago
model-00001-of-00002.safetensors

5 GB
LFS

Add FP8 block quantized model weights about 2 months ago
model-00002-of-00002.safetensors

4.08 GB
LFS

Add FP8 block quantized model weights about 2 months ago
model.safetensors.index.json

43.5 kB
LFS

Add FP8 block quantized model weights about 2 months ago
recipe.yaml

134 Bytes

Add FP8 block quantized model weights about 2 months ago
special_tokens_map.json

296 Bytes
LFS

Add FP8 block quantized model weights about 2 months ago
tokenizer.json

17.2 MB
LFS

Add FP8 block quantized model weights about 2 months ago
tokenizer_config.json

50.5 kB
LFS

Add FP8 block quantized model weights about 2 months ago