jiangchengchengNLP
/

L3.3-MS-Nevoria-70b-FP8

This is a checkpoint for quantization using tensor-model-optimizer, it is a static FP8 checkpoint, supporting vllm, sglang inference.

Safetensors

Model size

70.6B params

Tensor type

BF16

F8_E4M3

Model tree for jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8

Base model

Quantized

(28)

this model