Anthonyg5005
/

turbcat-instruct-8b-int8-ct2

Text Generation

quantized model

Model card Files Files and versions Community

Anthonyg5005 commited on Jun 17, 2024

Commit

7dedd28

·

verified ·

1 Parent(s): c7079e4

temp readme

Files changed (1) hide show

README.md +29 -3

README.md CHANGED Viewed

@@ -1,3 +1,29 @@
----
-license: unknown
----

+---
+license: unknown
+language:
+- en
+- zh
+library_name: CTranslate2
+pipeline_tag: text-generation
+tags:
+- facebook
+- meta
+- llama
+- llama-3
+- kaltcit
+- ct2
+- quantized model
+- int8
+base_model: turboderp/unknown
+---
+# CTranslate2 int8 version of turbcat
+This is a int8_float16 quantization of [turbcat](not released yet)\
+See more on CTranslate2: [Docs](https://opennmt.net/CTranslate2/index.html) | [Github](https://github.com/OpenNMT/CTranslate2)
+This model was converted to ct2 format using the following commnd:
+```
+ct2-transformers-converter --model kat_turbcat --output_dir turbcat-ct2 --quantization int8_bfloat16 --low_cpu_mem_usage
+```
+***no converstion needed using the model from this repository as it is already in ct2 format.***