cgus
/

SOLAR-10.7B-Instruct-v1.0-128k-GGUF

Model card Files Files and versions Community

cgus commited on Feb 2, 2024

Commit

bd41493

·

verified ·

1 Parent(s): 2b88331

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -30,7 +30,7 @@ Honestly, I have no idea how YARN models supposed to work out of the box.
 Llama.cpp should have proper YARN support but I haven't seen good enough information about real life application of this feature.
 Text-Generation-WebUI doesn't set loading params for YARN models automatically even though the params are defined in config.json of the model.
 Maybe some other apps do it properly but you might need to do it yourself as the model seems to output gibberish if scaling params aren't set properly.
-The model supposedly has 8k context with 16x RoPE scaling.
 I tried to load it with 8x scaling and potential 64k and it did seem to output text properly but 8x is max I could set in Text-Generation-WebUI.
 Didn't test it thoroughly with different scaling and context lengths, so can't promise anything.

 Llama.cpp should have proper YARN support but I haven't seen good enough information about real life application of this feature.
 Text-Generation-WebUI doesn't set loading params for YARN models automatically even though the params are defined in config.json of the model.
 Maybe some other apps do it properly but you might need to do it yourself as the model seems to output gibberish if scaling params aren't set properly.
+The model supposedly has 8k context with 16x RoPE scaling.
 I tried to load it with 8x scaling and potential 64k and it did seem to output text properly but 8x is max I could set in Text-Generation-WebUI.
 Didn't test it thoroughly with different scaling and context lengths, so can't promise anything.