Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Misc
Reset Misc
int8
Inference Endpoints
AutoTrain Compatible
Eval Results
text-generation-inference
8-bit precision
text-embeddings-inference
custom_code
4-bit precision
Misc with no match
Merge
Carbon Emissions
Mixture of Experts
Apply filters
Models
249
Full-text search
Edit filters
Sort: Trending
Active filters:
int8
Clear all
OpenNMT/Llama-2-7b-hf-ct2-int8
Text Generation
•
Updated
Dec 1, 2023
•
5
OpenNMT/Llama-2-7b-chat-hf-ct2-int8
Text Generation
•
Updated
Dec 1, 2023
•
4
minuva/MiniLMv2-goemotions-v2-onnx
Text Classification
•
Updated
Apr 24, 2024
•
2
•
2
avans06/ALMA-7B-ct2-int8_float16
Text Generation
•
Updated
Dec 15, 2023
•
12
avans06/ALMA-13B-ct2-int8_float16
Text Generation
•
Updated
Dec 15, 2023
•
5
minuva/MiniLMv2-toxic-jigsaw-lite-onnx
Text Classification
•
Updated
Apr 24, 2024
•
2
•
1
minuva/MiniLMv2-toxic-jigsaw-onnx
Text Classification
•
Updated
Apr 24, 2024
•
578
•
2
avans06/madlad400-7b-mt-bt-ct2-int8_float16
Updated
Dec 24, 2023
•
26
•
2
Intel/table-transformer-int8-static-inc
Updated
Dec 27, 2023
•
3
ecastera/eva-mistral-dolphin-7b-spanish
Text Generation
•
Updated
Mar 16, 2024
•
113
•
12
minuva/MiniLMv2-userflow-v2-onnx
Text Classification
•
Updated
Apr 24, 2024
•
6
•
1
minuva/MiniLMv2-agentflow-v2-onnx
Text Classification
•
Updated
Apr 24, 2024
•
554
•
2
ecastera/ecastera-eva-westlake-7b-spanish
Text Generation
•
Updated
Mar 16, 2024
•
24
•
2
jvh/whisper-base-quant-ct2
Automatic Speech Recognition
•
Updated
Mar 19, 2024
•
5
•
2
jvh/whisper-large-v2-quant-ct2
Automatic Speech Recognition
•
Updated
Mar 19, 2024
•
11
•
3
jvh/whisper-medium-quant-ct2
Automatic Speech Recognition
•
Updated
Mar 19, 2024
•
14
•
2
jvh/whisper-large-v3-quant-ct2
Automatic Speech Recognition
•
Updated
Mar 19, 2024
•
17
•
1
study-hjt/Meta-Llama-3-8B-Instruct-AWQ
Text Generation
•
Updated
Apr 23, 2024
•
79
•
1
study-hjt/Meta-Llama-3-8B-Instruct-GPTQ-Int8
Text Generation
•
Updated
Apr 23, 2024
•
17
•
2
study-hjt/Meta-Llama-3-70B-Instruct-GPTQ-Int8
Text Generation
•
Updated
Apr 23, 2024
•
8
•
2
avans06/Meta-Llama-3-8B-Instruct-ct2-int8_float16
Text Generation
•
Updated
Apr 25, 2024
•
19
nitsuai/ct2fast-all-MiniLM-L6-v2
Sentence Similarity
•
Updated
Apr 26, 2024
•
3
nitsuai/ct2fast-paraphrase-multilingual-MiniLM-L12-v2
Sentence Similarity
•
Updated
Apr 26, 2024
•
5
study-hjt/Qwen1.5-110B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 27, 2024
•
6
study-hjt/Qwen1.5-32B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 26, 2024
•
9
•
1
study-hjt/CodeQwen1.5-7B-Chat-GPTQ-Int8
Text Generation
•
Updated
Apr 26, 2024
•
78
•
1
Weblet/Llama-2-7b-chat-hf-ct2-int8
Text Generation
•
Updated
Apr 30, 2024
•
1
ecastera/eva-dolphin-llama3-8b-spanish
Text Generation
•
Updated
Jun 14, 2024
•
30
•
4
Anthonyg5005/L3-8B-Stheno-v3.1-int8-ct2
Text Generation
•
Updated
Jun 17, 2024
•
2
Anthonyg5005/turbcat-instruct-8b-int8-ct2
Text Generation
•
Updated
Jun 20, 2024
•
4
Previous
1
...
5
6
7
8
9
Next