npc0
/

chatglm3-6b-int4

Model card Files Files and versions Community

npc0 commited on Nov 1, 2023

Commit

e217685

·

1 Parent(s): e79db2f

Update README.md

Files changed (1) hide show

README.md +8 -1

README.md CHANGED Viewed

@@ -14,7 +14,14 @@ ChatGLM3-6B 是 ChatGLM 系列最新一代的开源模型，[THUDM/chatglm3-6b](
 用 [ChatGLM.CPP]() 基於 GGML quantize 生成 Q4_0、Q4_1 權重 weights 儲存於此倉庫。
-## Use in Python
 1. Install dependency
   ```sh
   pip install chatglm-cpp transformers

 用 [ChatGLM.CPP]() 基於 GGML quantize 生成 Q4_0、Q4_1 權重 weights 儲存於此倉庫。
+## Performance
+|Model                 |GGML quantize method| HDD size |1 token\*|
+|----------------------|--------------------|----------|---------|
+|chatglm3-ggml-q4_0.bin|        q4_0        |  3.51 GB |   74ms  |
+|chatglm3-ggml-q4_1.bin|        q4_1        |  3.9 GB  |   77ms  |
+\* ms/token (CPU @ Platinum 8260) from [reference](https://github.com/li-plus/chatglm.cpp#performance)
+## Getting Started
 1. Install dependency
   ```sh
   pip install chatglm-cpp transformers