Zephyr 7B Beta Llamafiles

See here for a guide on how to use llamafiles!

Both the server and the CLI are based on TheBloke's Zephyr 7B Beta GGUF Q4_K_M model.

Usage

NOTE: Due to the executable being greater than 4GB, it is currently not compatible with Windows. I will update with a Windows friendly version of Zephyr 7B Beta when I can.

# replace with the CLI if you prefer
wget https://huggingface.co/TimeSurgeLabs/zephyr-7b-beta-llamafile/resolve/main/zephyr-beta-server.llamafile
chmod +x zephyr-beta-server.llamafile
./zephyr-beta-server.llamafile
Downloads last month
14
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and HF Inference API was unable to determine this model's library.

Collection including TimeSurgeLabs/zephyr-7b-beta-llamafile