cgus
/

HuatuoGPT-o1-7B-exl2

Text Generation

4-bit precision

Model card Files Files and versions Community

cgus commited on Feb 4

Commit

ac0d189

·

verified ·

1 Parent(s): c9308f0

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -7,12 +7,28 @@ language:
 - en
 - zh
 base_model:
-- Qwen/Qwen2.5-7B-Instruct
 pipeline_tag: text-generation
 tags:
 - medical
 ---
 <div align="center">
 <h1>
   HuatuoGPT-o1-7B

 - en
 - zh
 base_model:
+- FreedomIntelligence/HuatuoGPT-o1-7B
 pipeline_tag: text-generation
 tags:
 - medical
 ---
+# HuatuoGPT-o1-7B-exl2
+Original model: [HuatuoGPT-o1-7B](https://huggingface.co/FreedomIntelligence/HuatuoGPT-o1-7B) made by [FreedomIntelligence](https://huggingface.co/FreedomIntelligence)
+Based on: [Qwen2.5-7B-Instruct](https://huggingface.co/Qwen/Qwen2.5-7B-Instruct) by [Qwen](https://huggingface.co/Qwen)
+## Quants
+[4bpw h6 (main)](https://huggingface.co/cgus/HuatuoGPT-o1-7B-exl2/tree/main)
+[4.5bpw h6](https://huggingface.co/cgus/HuatuoGPT-o1-7B-exl2/tree/4.5bpw-h6)
+[5bpw h6](https://huggingface.co/cgus/HuatuoGPT-o1-7B-exl2/tree/5bpw-h6)
+[6bpw h6](https://huggingface.co/cgus/HuatuoGPT-o1-7B-exl2/tree/6bpw-h6)
+[8bpw h8](https://huggingface.co/cgus/HuatuoGPT-o1-7B-exl2/tree/8bpw-h8)
+## Quantization notes
+Made with Exllamav2 0.2.7 with default dataset.
+Exl2 quants require Nvidia RTX on Windows or Nvidia RTX/AMD ROCm on Linux.
+Model has to fully fit GPU as RAM offloading isn't supported natively.
+It can be used with apps such as TabbyAPI, Text-Generation-WebUI, LoLLMs and others.
+# Original model card
 <div align="center">
 <h1>
   HuatuoGPT-o1-7B