DavidAU
/

Qwen2.5-MOE-6x1.5B-DeepSeek-Reasoning-e32-8.71B-gguf

Model card Files Files and versions Community

DavidAU commited on 9 days ago

Commit

9af5375

·

verified ·

1 Parent(s): 8f0c0d4

Update README.md

Files changed (1) hide show

README.md +6 -1

README.md CHANGED Viewed

@@ -77,7 +77,12 @@ This model is also mastered in Float 32, which helped overall model generation a
 Temp of .4 to .8 is suggested, however it will still operate at much higher temps like 1.8, 2.6 etc.
-Also set context limit at 4k minimum.
 Quants uploaded: Q4_K_S, Q8_0

 Temp of .4 to .8 is suggested, however it will still operate at much higher temps like 1.8, 2.6 etc.
+Depending on your prompt change temp SLOWLY: IE: .41,.42,.43 ... etc etc.
+Likewise, because these are small models, it may do a tonne of "thinking"/"reasoning" and then "forget" to finish a / the task(s). In
+this case, prompt the model to "Complete the task XYZ with the 'reasoning plan' above" .
+Also set context limit at 4k minimum, 8K+ suggested.
 Quants uploaded: Q4_K_S, Q8_0