INT8 ONNX version of Felladrin/TinyMistral-248M-Chat-v2 to use with Transformers.js.

Downloads last month
89
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API has been turned off for this model.

Model tree for Felladrin/onnx-TinyMistral-248M-Chat-v2

Quantized
(12)
this model