Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

text-embeddings-inference

4-bit precision

8-bit precision

Mixture of Experts

Carbon Emissions

Models

7,324

Full-text search

Active filters: gptq

TheBloke/Wizard-Vicuna-30B-Uncensored-GPTQ

Text Generation • 33B • Updated Sep 27, 2023 • 109k • 604

Qwen/Qwen3-30B-A3B-GPTQ-Int4

Text Generation • 31B • Updated May 21, 2025 • 291k • 47

Qwen/Qwen-VL-Chat-Int4

Text Generation • 10B • Updated Jan 25, 2024 • 2.07k • 94

TheBloke/Falcon-180B-GPTQ

Text Generation • 179B • Updated Sep 27, 2023 • 34 • 9

Qwen/Qwen1.5-0.5B-Chat-GPTQ-Int4

Text Generation • 0.5B • Updated Apr 30, 2024 • 1.64k • 14

Qwen/Qwen2.5-14B-Instruct-GPTQ-Int4

Text Generation • 15B • Updated Oct 9, 2024 • 92.2k • 25

Qwen/Qwen2.5-Coder-7B-Instruct-GPTQ-Int4

Text Generation • 8B • Updated Nov 18, 2024 • 800k • 13

Qwen/Qwen2.5-Coder-14B-Instruct-GPTQ-Int8

Text Generation • 15B • Updated Jan 12, 2025 • 8.28k • 6

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v1

Text Generation • 8B • Updated Jan 24, 2025 • 9 • 6

ModelCloud/DeepSeek-R1-Distill-Qwen-7B-gptqmodel-4bit-vortex-v2

Text Generation • 8B • Updated Jan 24, 2025 • 853 • 8

ModelCloud/QwQ-32B-gptqmodel-4bit-vortex-v1

Text Generation • 33B • Updated Mar 9, 2025 • 14 • 12

JunHowie/Qwen3-32B-GPTQ-Int8

Text Generation • 33B • Updated Sep 5, 2025 • 1.31k • 4

AngelSlim/Qwen3-14B_int4_gptq

15B • Updated Jul 10, 2025 • 8 • 1

QuantTrio/Qwen3-Coder-30B-A3B-Instruct-GPTQ-Int8

Text Generation • 31B • Updated Sep 5, 2025 • 4.24k • 8

JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int4

Text Generation • 4B • Updated Sep 4, 2025 • 4.71k • 2

openguardrails/OpenGuardrails-Text-2510

Text Generation • 15B • Updated 24 days ago • 13.5k • 7

ModelCloud/Granite-4.0-H-1B-GPTQMODEL-W4A16

Text Generation • 1B • Updated Oct 31, 2025 • 6 • 1

ModelCloud/Granite-4.0-H-350M-GPTQMODEL-W4A16

Text Generation • 0.3B • Updated Oct 31, 2025 • 24 • 1

ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16

Text Generation • 15B • Updated Oct 31, 2025 • 5 • 1

ModelCloud/Brumby-14B-Base-GPTQMODEL-W4A16-v2

Text Generation • 15B • Updated Oct 31, 2025 • 5 • 1

avtc/GLM-4.5-Air-GPTQMODEL-W8A16

Text Generation • 116B • Updated Dec 1, 2025 • 8 • 2

SEOKDONG/gpt-oss-safeguard-20b-kor-enterprise-gptq-4bit

Text Generation • 21B • Updated Dec 2, 2025 • 96 • 2

ModelCloud/Qwen3-Coder-30B-A3B-Instruct-GPTQMODEL-W4A16-A

Text Generation • 31B • Updated Dec 25, 2025 • 25 • 1

ModelCloud/Qwen3-Coder-30B-A3B-Instruct-GPTQMODEL-W4A16-B

Text Generation • 31B • Updated Dec 25, 2025 • 8 • 1

plezan/MiniMax-M2.1-REAP-50-W4A16

Text Generation • Updated Jan 14 • 1.37k • 6

Mohaaxa/qwen2.5-1.5b-gptq-4bit-v2

Text Generation • 2B • Updated 8 days ago • 27 • 1

elinas/alpaca-13b-lora-int4

Text Generation • Updated Apr 5, 2023 • 16 • 41

elinas/alpaca-30b-lora-int4

Text Generation • Updated Apr 5, 2023 • 5 • 68

mayaeary/pygmalion-6b-4bit-128g

Text Generation • Updated Mar 28, 2023 • 6 • 40

mayaeary/pygmalion-6b_dev-4bit-128g

Text Generation • Updated Mar 28, 2023 • 12 • 121