Edit Models filters

Inference Providers

HF Inference API

Misc

4bit-quantization

Inference Endpoints

text-generation-inference

4-bit precision

8-bit precision

text-embeddings-inference

Mixture of Experts

Carbon Emissions

Models

5

Full-text search

Active filters: 4bit-quantization

boods/mistral_location_extractor_4bit_0.1

Text Generation • 4B • Updated May 9 • 5

retro56/gemma3-4b-bengali-multimodal-persona

Text Generation • Updated May 31

AnnasShaikh/TinyLlama-1.1B-Chat-Roast

Antez02k/Murmur_v1

Text Generation • Updated about 18 hours ago • 29

Jackrong/gpt-oss-20b-MLX-4bit

Text Generation • 21B • Updated 22 days ago • 186