File size: 443 Bytes
2922088
37b8ea9
 
 
 
 
 
 
cfa3591
 
 
 
2922088
 
cfa3591
49b5be2
cfa3591
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
---
title: Llama Hqq 1 Bit
emoji: πŸ“Š
colorFrom: green
colorTo: pink
sdk: gradio
sdk_version: 4.24.0
app_file: app.py
license: llama2
train: false
inference: false
pipeline_tag: text-generation
---

Demo for  HQQ 1-bit quantized (binary weights) Llama2-7B-chat model using a low-rank adapter to improve the performance (referred to as HQQ+).
You will need a GPU for this.

https://huggingface.co/mobiuslabsgmbh/Llama-2-7b-chat-hf_1bitgs8_hqq