Llama.cpp suppfor for Qwen2ForCausalRM?
#1
by
twoxfh
- opened
Any plans to support a Qwen2ForCausalRM architecture contribution to Llama.cpp? I would really like to try this model out, given the space requirements its not feasible without quantization. Appreciate your input and thank you for reading.