Edit Models filters

Apps

Docker Model Runner

Inference Providers

OVHcloud AI Endpoints

HF Inference API

Misc

Inference Endpoints

text-generation-inference

Eval Results (legacy)

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

629

Full-text search

Active filters: quantization

Octen/Octen-Embedding-8B-INT8

Sentence Similarity • 8B • Updated about 6 hours ago • 26 • 3

legraphista/DeepSeek-Coder-V2-Instruct-IMat-GGUF

Text Generation • 236B • Updated Jun 19, 2024 • 610 • 6

HyperX-Sentience/SDXL-GGUF

Text-to-Image • 3B • Updated Jun 24, 2025 • 410 • 12

stabilityai/stable-diffusion-3.5-large-tensorrt

Text-to-Image • Updated Oct 20, 2025 • 554 • 51

ArtusDev/requests-exl

Updated Oct 13, 2025 • 6

dougeeai/llama-cpp-python-wheels

Updated Nov 9, 2025 • 3

EricRollei/HunyuanImage-3-NF4-ComfyUI

Text-to-Image • 83B • Updated Nov 24, 2025 • 34 • 2

avtc/GLM-4.6-REAP-268B-A32B-GPTQMODEL-W4A16

Text Generation • 271B • Updated Dec 18, 2025 • 103 • 3

coughmedicine/Huihui-Qwen3-Next-80B-A3B-Instruct-abliterated-nvfp4

Updated Dec 13, 2025 • 51 • 1

drbaph/Qwen-Image-Edit-2511-FP8

Image-to-Image • Updated 26 days ago • 4.4k • 6

goniz/MiniMax-M2.1-REAP-30-GGUF

162B • Updated 5 days ago • 1.79k • 1

ryukin164/LFM2.5-1.2B-Q4-JP

Text Generation • 1B • Updated 2 days ago • 279 • 1

ethzanalytics/gpt-j-6B-8bit-sharded

Text Generation • 6B • Updated Jan 10, 2025 • 6 • 7

ethzanalytics/gpt-j-8bit-daily_dialogues

Text Generation • 6B • Updated Dec 25, 2024 • 10 • 4

ethzanalytics/gpt-j-8bit-KILT_WoW_10k_steps

Text Generation • Updated Nov 27, 2022 • 13

leumastai/t5-large-quantized

Updated Mar 16, 2023 • 3 • 1

pszemraj/stablelm-7b-sft-v7e3-autogptq-4bit-128g

Text Generation • Updated 21 days ago • 8 • 3

limcheekin/flan-t5-small-ct2

Updated May 24, 2023 • 4

limcheekin/flan-t5-xl-ct2

Updated Jun 3, 2023 • 8 • 1

limcheekin/flan-t5-xxl-ct2

Updated May 30, 2023 • 2 • 1

limcheekin/fastchat-t5-3b-ct2

Text Generation • Updated Jun 28, 2023 • 2 • 2

limcheekin/flan-alpaca-gpt4-xl-ct2

Updated Jun 4, 2023 • 2

limcheekin/mpt-7b-storywriter-ct2

Updated Jun 27, 2023

limcheekin/falcon-7b-instruct-ct2

Updated Jun 19, 2023 • 2 • 1

limcheekin/mpt-7b-instruct-ct2

Updated Jun 19, 2023 • 2

limcheekin/redpajama-chat-7b-ct2

Updated Jun 9, 2023 • 4

seonglae/wizardlm-7b-uncensored-gptq

Text Generation • Updated Jul 19, 2023 • 13

seonglae/llama-2-7b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 6

seonglae/llama-2-13b-chat-hf-gptq

Text Generation • Updated Jul 20, 2023 • 6

clibrain/Llama-2-7b-ft-instruct-es-gptq-4bit

Text Generation • Updated Sep 1, 2023 • 10 • 9