Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
Fireworks
Replicate
Novita
Hyperbolic
SambaNova
Together AI
Nebius AI Studio
fal
HF Inference API
Misc
Reset Misc
llama-cpp
Inference Endpoints
Merge
text-generation-inference
Eval Results
4-bit precision
AutoTrain Compatible
8-bit precision
Mixture of Experts
Carbon Emissions
custom_code
text-embeddings-inference
Apply filters
Models
15,669
Full-text search
Edit filters
Sort: Trending
Active filters:
llama-cpp
Clear all
janw23/Phi-3-medium-4k-instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 27, 2024
•
6
UniOb/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
7
HDiffusion/Llama-3-Instruct-8B-SPPO-Iter3-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
20
Juanma12/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
1
capricornstone/MING-1.8B-Q4_K_M-GGUF
Updated
Jun 28, 2024
•
38
capricornstone/MING-1.8B-Q8_0-GGUF
Updated
Jun 28, 2024
•
7
joshnader/Meta-Llama-3-8B-Instruct-Q4_K_M-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
2
jeiku/Aura_Qwen2_v3_7B-Q4_K_M-GGUF
Updated
Jun 28, 2024
capricornstone/ChiMed-GPT-1.0-Q4_K_M-GGUF
Updated
Jun 28, 2024
saurabhraj115/Meta-Llama-3-8B-Q8_0-GGUF
Text Generation
•
Updated
Jun 28, 2024
PaoloRosa/distilgpt2-Q4_K_S-GGUF
Updated
Jun 28, 2024
•
10
yesquiteno/Phi-3-mini-4k-instruct-Q2_K-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
2
Tech-Meld/llm-compiler-7b-Q4_K_M-GGUF
Updated
Jun 28, 2024
•
12
•
1
NikolayKozloff/oneirogen-7B-Q4_0-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
5
•
1
NikolayKozloff/oneirogen-7B-Q5_0-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
1
•
1
saurabhraj115/Meta-Llama-3-8B-Instruct-Q4_K_S-GGUF
Text Generation
•
Updated
Jun 28, 2024
•
1
bunnycore/SparrowMind-8B-Q5_K_M-GGUF
Updated
Jun 28, 2024
•
3
BLURPLETESTS/L3-15B-Stheno-v3.3-32K-exp-Q5_K_M-GGUF
Updated
Jan 14
•
22
BLURPLETESTS/LLaMa-3-Stheno-v3.2-15B-Q5_K_M-GGUF
Updated
Jun 28, 2024
•
12
Goekdeniz-Guelmez/J.O.S.I.E.v4o-8b-stage1-beta2.2-Q4_K_S-GGUF
Updated
Jun 28, 2024
•
8
•
1
gate369/Bitnet-Mistral.0.2-v6.8-Q8_0-GGUF
Updated
Jun 28, 2024
•
7
YorkieOH10/Qwen2-7B-Multilingual-RP-Q8_0-GGUF
Updated
Jun 28, 2024
•
30
•
2
Pekarnick/e5-large-v2-Q4_K_M-GGUF
Sentence Similarity
•
Updated
Jun 28, 2024
•
3
YorkieOH10/Qwen2-7B-Multilingual-RP-Q5_K_M-GGUF
Updated
Jun 28, 2024
•
32
YorkieOH10/Qwen2-7B-Multilingual-RP-Q4_K_M-GGUF
Updated
Jun 28, 2024
•
162
•
1
Leths/gpt2-Q4_K_M-GGUF
Updated
Jun 28, 2024
•
19
Kurgan1138/L3-8B-Instruct-Abliterated-DWP-Q5_K_M-GGUF
Updated
Jun 28, 2024
•
1
NikolayKozloff/L3-8B-Lunaris-v1-Q4_0-GGUF
Updated
Jun 28, 2024
•
17
•
1
NikolayKozloff/L3-8B-Lunaris-v1-Q5_0-GGUF
Updated
Jun 28, 2024
•
7
•
1
NikolayKozloff/L3-8B-Lunaris-v1-IQ4_NL-GGUF
Updated
Jun 28, 2024
•
36
•
1
Previous
1
...
97
98
99
100
Next