Spaces:
Runtime error
Runtime error
File size: 443 Bytes
2922088 37b8ea9 cfa3591 2922088 cfa3591 49b5be2 cfa3591 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 |
---
title: Llama Hqq 1 Bit
emoji: π
colorFrom: green
colorTo: pink
sdk: gradio
sdk_version: 4.24.0
app_file: app.py
license: llama2
train: false
inference: false
pipeline_tag: text-generation
---
Demo for HQQ 1-bit quantized (binary weights) Llama2-7B-chat model using a low-rank adapter to improve the performance (referred to as HQQ+).
You will need a GPU for this.
https://huggingface.co/mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq
|