can this model run with `ollama` with `pure cpu` model?
#7 opened 1 day ago
by
ice6
Add `quantization_config` in config.json?
4
#4 opened 6 days ago
by
WeiwenXia
运行channel INT8后sglang报错OOM
1
#3 opened 8 days ago
by
zhangneilc
Difference with Block-wise Int8?
1
#1 opened 13 days ago
by
leo98xh