-
-
-
-
-
-
Inference Providers
Active filters:
gptq
TheBloke/Wizard-Vicuna-13B-Uncensored-GPTQ
Text Generation
•
2B
•
Updated
•
336
•
320
TheBloke/vicuna-7B-v1.5-GPTQ
Text Generation
•
1B
•
Updated
•
79
•
16
TheBloke/Phind-CodeLlama-34B-v2-GPTQ
Text Generation
•
5B
•
Updated
•
28
•
90
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4
Image-Text-to-Text
•
1B
•
Updated
•
722
•
27
Qwen/Qwen2.5-32B-Instruct-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
37.4k
•
39
Qwen/Qwen2.5-72B-Instruct-GPTQ-Int4
Text Generation
•
12B
•
Updated
•
23.7k
•
40
Qwen/Qwen2.5-Coder-32B-Instruct-GPTQ-Int4
Text Generation
•
6B
•
Updated
•
34.3k
•
21
Qwen/Qwen3-235B-A22B-GPTQ-Int4
Text Generation
•
Updated
•
75.5k
•
25
QuantTrio/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
191k
•
9
xiangxinai/Xiangxin-Guardrails-Text
Text Generation
•
3B
•
Updated
•
106
•
10
QuantTrio/KAT-Dev-GPTQ-Int8
Text Generation
•
33B
•
Updated
•
12
•
1
QuantTrio/Kimi-Dev-72B-GPTQ-Int4
Text Generation
•
73B
•
Updated
•
114
•
2
QuantTrio/Kimi-Dev-72B-GPTQ-Int8
Text Generation
•
73B
•
Updated
•
28
•
2
MidnightPhreaker/GLM-4.5-Air-REAP-82B-A12B-GPTQ-INT4-gs32
14B
•
Updated
•
206
•
3
ModelCloud/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16
Text Generation
•
269B
•
Updated
•
35
•
1
ModelCloud/MiniMax-M2-GPTQMODEL-W4A16
Text Generation
•
229B
•
Updated
•
67
•
1
ModelCloud/Marin-32B-Base-GPTQMODEL-W4A16
Text Generation
•
33B
•
Updated
•
9
•
1
elinas/alpaca-13b-lora-int4
Text Generation
•
Updated
•
2
•
41
elinas/alpaca-30b-lora-int4
Text Generation
•
Updated
•
2
•
68
mayaeary/pygmalion-6b-4bit-128g
Text Generation
•
Updated
•
17
•
40
mayaeary/pygmalion-6b_dev-4bit-128g
Text Generation
•
Updated
•
7
•
120
mayaeary/PPO_Pygway-V8p4_Dev-6b-4bit-128g
Text Generation
•
Updated
•
2
•
2
mayaeary/PPO_Pygway-6b-Mix-4bit-128g
Text Generation
•
Updated
•
2
•
2
elinas/vicuna-13b-4bit
Text Generation
•
Updated
•
6
•
45
TheBloke/koala-7B-GPTQ
Text Generation
•
1B
•
Updated
•
10
•
31
TheBloke/koala-7B-HF
Text Generation
•
Updated
•
1.07k
•
21
TheBloke/koala-13B-HF
Text Generation
•
Updated
•
755
•
41
TheBloke/koala-13B-GPTQ
Text Generation
•
2B
•
Updated
•
12
•
38
TheBloke/galpaca-30B-GPTQ
Text Generation
•
Updated
•
3
•
48
Ancestral/Dolly_Shygmalion-6b-4bit-128g
Text Generation
•
Updated
•
1
•
5