ggml_llava-v1.5-7b

This repo contains GGUF files to inference llava-v1.5-7b with llama.cpp end-to-end without any extra dependency.

Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.

Downloads last month
1,690
GGUF
Model size
6.74B params
Architecture
llama

4-bit

5-bit

16-bit

Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Spaces using mys/ggml_llava-v1.5-7b 3