This model was converted to FP8 format from watt-ai/watt-tool-8B using the llmcompressor library by vLLM. Refer to the original model card for more details on the model.

Downloads last month: 105

Safetensors

Model size

8.03B params

Tensor type

BF16

F8_E4M3

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.

Model tree for tolgaakar/watt-tool-8B-FP8-Dynamic

Base model

meta-llama/Llama-3.1-8B

Finetuned

meta-llama/Llama-3.1-8B-Instruct

Finetuned

watt-ai/watt-tool-8B

Quantized

(14)

this model