Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Hyperbolic
Together AI
Replicate
Fireworks
Novita
Cerebras
SambaNova
Nebius AI Studio
fal
HF Inference API
Misc
Reset Misc
VPTQ
Misc with no match
Inference Endpoints
AutoTrain Compatible
text-generation-inference
Eval Results
Merge
4-bit precision
8-bit precision
custom_code
text-embeddings-inference
Carbon Emissions
Mixture of Experts
Apply filters
Models
70
Full-text search
Edit filters
Sort: Trending
Active filters:
VPTQ
Clear all
VPTQ-community/deepseek-r1_v_8_k_65536_mixed_mp4
Updated
2 days ago
•
7
•
2
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v12-k65536-4096-woft
Updated
Jan 13
•
244
•
3
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-32768-woft
Updated
16 days ago
•
31
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-65536-woft
Updated
Nov 18, 2024
•
49
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-4096-woft
Updated
Nov 18, 2024
•
32
VPTQ-community/Meta-Llama-3.1-8B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
130
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-65536-woft
Updated
16 days ago
•
29
•
4
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v16-k65536-65536-woft
Updated
16 days ago
•
30
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-256-woft
Updated
16 days ago
•
53
•
1
VPTQ-community/Qwen2.5-7B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
32
VPTQ-community/Qwen2.5-72B-Instruct-v16-k65536-32768-woft
Updated
16 days ago
•
30
•
3
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k32768-0-woft
Updated
16 days ago
•
46
•
1
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-65536-woft
Updated
16 days ago
•
35
•
2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k16384-0-woft
Updated
16 days ago
•
27
•
2
VPTQ-community/Meta-Llama-3.1-70B-Instruct-v8-k65536-0-woft
Updated
16 days ago
•
38
•
2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft-duplicated
Updated
16 days ago
•
27
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-1024-woft
Updated
16 days ago
•
21
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v8-k4096-0-woft
Updated
16 days ago
•
23
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-64-woft
Updated
16 days ago
•
25
•
3
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k32768-32768-woft
Updated
15 days ago
•
32
•
1
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-128-woft
Updated
16 days ago
•
17
•
1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-4-woft
Updated
16 days ago
•
44
•
2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-0-woft
Updated
16 days ago
•
44
•
2
VPTQ-community/Qwen2.5-72B-Instruct-v8-k512-512-woft
Updated
16 days ago
•
37
•
1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k1024-512-woft
Updated
16 days ago
•
31
•
2
VPTQ-community/Meta-Llama-3.1-405B-Instruct-v16-k65536-256-woft
Updated
16 days ago
•
33
•
1
VPTQ-community/Qwen2.5-72B-Instruct-v8-k65536-256-woft
Updated
16 days ago
•
47
•
4
VPTQ-community/Qwen2.5-14B-Instruct-v8-k256-256-woft
Updated
Nov 18, 2024
•
36
VPTQ-community/Qwen2.5-14B-Instruct-v16-k65536-65536-woft
Updated
Nov 18, 2024
•
19
VPTQ-community/Qwen2.5-14B-Instruct-v8-k65536-256-woft
Updated
Nov 18, 2024
•
44
Previous
1
2
3
Next