Llama.cpp suppfor for Qwen2ForCausalRM?

#1
by twoxfh - opened

Any plans to support a Qwen2ForCausalRM architecture contribution to Llama.cpp? I would really like to try this model out, given the space requirements its not feasible without quantization. Appreciate your input and thank you for reading.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment