GGUF quantization error

by Doctor-Chad-PhD - opened 1 day ago

1 day ago

•

I'm getting this error when trying to quantize this model to gguf with llama.cpp:

AssertionError: HunYuan dynamic RoPE scaling assumptions changed, please update the logic or context length manually

Is there any way to fix this?

Thank you

about 21 hours ago

it's odd the chimera model can be gguffed

hhoh

Tencent org about 6 hours ago

We have renew the "max_position_embeddings" in config.json, could you please try again?

about 6 hours ago

We have renew the "max_position_embeddings" in config.json, could you please try again?

Yes, it can be converted to GGUF and quantized with the change. Thank you.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment