lars1234
/

Mistral-Small-24B-Instruct-2501-writer-AWQ

4-bit precision

Model card Files Files and versions Community

lars1234 commited on 5 days ago

Commit

37aee63

·

verified ·

1 Parent(s): 3ce053e

Create README.md

Files changed (1) hide show

README.md +18 -0

README.md ADDED Viewed

	@@ -0,0 +1,18 @@

+---
+license: apache-2.0
+datasets:
+- lars1234/story_writing_benchmark
+base_model:
+- lars1234/Mistral-Small-24B-Instruct-2501-writer
+---
+# Mistral-Small-24B-Instruct-2501-writer-AWQ
+This model is the 4-bit AWQ-quantized version of [Mistral-Small-24B-Instruct-2501-writer](lars1234/Mistral-Small-24B-Instruct-2501-writer).
+- **Quantization Method**: AWQ (Activation-aware Weight Quantization)
+- **Quantization Configuration**:
+  - Bit Width: 4-bit
+  - Group Size: 128
+  - Zero Point: Enabled
+  - Version: GEMM