cgus
/

SOLAR-10.7B-Instruct-v1.0-128k-GGUF

Model card Files Files and versions Community

cgus commited on Feb 2, 2024

Commit

2b88331

·

verified ·

1 Parent(s): 309ee03

Update README.md

Files changed (1) hide show

README.md +2 -3

README.md CHANGED Viewed

@@ -29,11 +29,10 @@ Created by: [upstage](https://huggingface.co/upstage)
 Honestly, I have no idea how YARN models supposed to work out of the box.
 Llama.cpp should have proper YARN support but I haven't seen good enough information about real life application of this feature.
 Text-Generation-WebUI doesn't set loading params for YARN models automatically even though the params are defined in config.json of the model.
-Maybe some other apps do it properly but you might need to do it yourself as the model seem to output gibberish if scaling params aren't set properly.
 The model supposedly has 8k context with 16x RoPE scaling.
 I tried to load it with 8x scaling and potential 64k and it did seem to output text properly but 8x is max I could set in Text-Generation-WebUI.
-Didn't test it thoroughly with different scaling and context lengths, can't promise anything ¯\\\_(ツ)\_/¯.
-As a side note, GGUF conversion process is so blazingly fast compared to exl2, I'm impressed...
 ## How to run

 Honestly, I have no idea how YARN models supposed to work out of the box.
 Llama.cpp should have proper YARN support but I haven't seen good enough information about real life application of this feature.
 Text-Generation-WebUI doesn't set loading params for YARN models automatically even though the params are defined in config.json of the model.
+Maybe some other apps do it properly but you might need to do it yourself as the model seems to output gibberish if scaling params aren't set properly.
 The model supposedly has 8k context with 16x RoPE scaling.
 I tried to load it with 8x scaling and potential 64k and it did seem to output text properly but 8x is max I could set in Text-Generation-WebUI.
+Didn't test it thoroughly with different scaling and context lengths, so can't promise anything.
 ## How to run