Ba2han
/

qwen-coder-thinker-q4_k_m

Inference Endpoints

Model card Files Files and versions Community

Ba2han commited on 16 days ago

Commit

41c1b9d

·

verified ·

1 Parent(s): d880956

Update README.md

Files changed (1) hide show

README.md +3 -1

README.md CHANGED Viewed

@@ -7,4 +7,6 @@ base_model:
 - Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
 - Merged & Quantized into Q4_K_M
-### Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.

 - Extracted a 64 Rank Lora from DeepSeek-R1-Distill-Qwen-32B
 - Merged & Quantized into Q4_K_M
+### Note: The model seems to be somewhat working with the R1's weird template too but it repeats random Chinese characters and the quality seems to be consistently worse.
+### Maybe try using the R1 tokenizer.