Edit Models filters

Inference Providers

HF Inference API

Misc

speculative-decoding

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

30

Full-text search

Active filters: speculative-decoding

jukofyork/Qwen3-0.6B-YaRN-GGUF

0.8B • Updated 25 days ago • 293 • 3

mradermacher/DeepSeek-R1-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 86

mradermacher/DeepSeek-V3-0324-DRAFT-0.5B-v1.0-GGUF

0.5B • Updated Jul 11 • 63

Gapeleon/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-Q4_K_M-GGUF

0.6B • Updated Jun 10 • 8

Goldenwert/multitoken-gpt2-metamathqa

Text Generation • Updated Jun 10 • 13

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 249

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.0-GGUF

0.6B • Updated Jul 11 • 144

corupta/dseek-draft-test

0.8B • Updated Jun 13 • 6

nm-testing/eagle-llama3.1-8b-instruct

0.3B • Updated Jul 9 • 2

nm-testing/hass-llama3.1-8b-layernorms

0.3B • Updated Jul 9 • 3

mradermacher/DeepSeek-R1-0528-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 146

mradermacher/DeepSeek-V3-0324-CODER-DRAFT-0.6B-v1.1-GGUF

0.6B • Updated Jul 11 • 375

mradermacher/DeepSeek-R1-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 20 • 119

mradermacher/DeepSeek-V3-DRAFT-0.6B-v2.0-GGUF

0.6B • Updated Jul 22 • 74

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0

0.6B • Updated 27 days ago • 23 • 1

jukofyork/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 27 days ago • 1.46k • 13

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 27 days ago • 1.9k

mradermacher/GLM-4.5-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated 27 days ago • 1.05k

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0

0.6B • Updated 25 days ago • 17

jukofyork/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 27 days ago • 451

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 26 days ago • 1.17k

mradermacher/DeepSeek-R1-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated 26 days ago • 2.26k

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0

0.6B • Updated 25 days ago • 15

jukofyork/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 25 days ago • 136

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0

0.7B • Updated 25 days ago • 40

jukofyork/Kimi-K2-Instruct-DRAFT-0.6B-v3.0-GGUF

0.7B • Updated 25 days ago • 895

jukofyork/Qwen3-Coder-Instruct-DRAFT-0.75B-GGUF

0.8B • Updated 25 days ago • 661 • 3

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-GGUF

0.6B • Updated 24 days ago • 3.76k

mradermacher/DeepSeek-V3-DRAFT-0.6B-v3.0-i1-GGUF

0.6B • Updated 24 days ago • 4.15k

nm-testing/llama4-scout-17b-eagle3-dummy-drafter

Updated 8 days ago • 79