This is a checkpoint for quantization using tensor-model-optimizer, it is a static FP8 checkpoint, supporting vllm, sglang inference.
- Downloads last month
- 11
Model tree for jiangchengchengNLP/L3.3-MS-Nevoria-70b-FP8
Base model
Steelskull/L3.3-MS-Nevoria-70b