ggml_llava-v1.5-7b

This repo contains GGUF files to inference llava-v1.5-7b with llama.cpp end-to-end without any extra dependency.

Note: The mmproj-model-f16.gguf file structure is experimental and may change. Always use the latest code in llama.cpp.

Downloads last month: 1,690

GGUF

Model size

6.74B params

Architecture

llama

4-bit

5-bit

16-bit

Inference Providers NEW

This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

mys
/

ggml_llava-v1.5-7b

ggml_llava-v1.5-7b

Spaces using mys/ggml_llava-v1.5-7b 3