Ba2han commited on
Commit
41c1b9d
·
verified ·
1 Parent(s): d880956

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -1
README.md CHANGED
@@ -7,4 +7,6 @@ base_model:
7
  - Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
8
  - Merged & Quantized into Q4_K_M
9
 
10
- ### Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.
 
 
 
7
  - Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
8
  - Merged & Quantized into Q4_K_M
9
 
10
+ ### Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.
11
+
12
+ ### Maybe try using the R1 tokenizer.