Text Generation
Transformers
Safetensors
mistral
4-bit precision
AWQ
Inference Endpoints
chatml
text-generation-inference
awq
File size: 82 Bytes
d268c49
 
 
 
 
 
1
2
3
4
5
6
{
  "zero_point": true,
  "q_group_size": 128,
  "w_bit": 4,
  "version": "GEMM"
}