"Usage on Windows does not work out of the box because the repository tries to use flash attention" Flash attention 2 on Windows requires Cuda 12.2 or higher Other than that it works fine
Your need to confirm your account before you can post a new comment.