Here's a q5_1 quantized version of the WavTokenizer, had to make a few tweaks to quantize it correctly, as there are some tensors with incompatible shapes. But it works correctly with the current version on llama.cpp, and is a somewhat smaller. Feel free to try it.

ggerganov changed pull request status to merged
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment