Created using the fork pwilkin/llama.cpp, commit 8f64302. The main repo now supports the models! The quantization process is still the same, no re-making of the models is needed.
This is still in development, expect issues.
Settings
- temp: 1.1
- top-p: 0.95
The IQ models are made using bartowski1182/calibration_datav3.txt.
- Downloads last month
- 1,052
Hardware compatibility
Log In
to view the estimation
Model tree for RDson/Seed-OSS-36B-Instruct-GGUF
Base model
ByteDance-Seed/Seed-OSS-36B-Instruct