RedHatAI/QwQ-32B-Preview-quantized.w4a16
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w8a8
Text Generation
•
71B
•
Updated
•
12
RedHatAI/Llama-3.1-Nemotron-70B-Instruct-HF-quantized.w4a16
Text Generation
•
11B
•
Updated
•
7
RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
7
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
7
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
9
•
1
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
8
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
13
•
3
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
9
RedHatAI/Qwen2.5-3B-quantized.w4a16
Text Generation
•
1.0B
•
Updated
•
13
RedHatAI/Qwen2.5-1.5B-quantized.w4a16
Text Generation
•
0.6B
•
Updated
•
8
RedHatAI/Qwen2.5-0.5B-quantized.w4a16
Text Generation
•
0.3B
•
Updated
•
10
RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8
Text Generation
•
15B
•
Updated
•
111
RedHatAI/granite-3.1-8b-instruct-GGUF
8B
•
Updated
•
12
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
108
•
62
RedHatAI/Qwen2.5-Math-7B-Instruct-FP8-dynamic
8B
•
Updated
•
7
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
876
RedHatAI/Qwen2.5-72B-FP8-dynamic
Text Generation
•
73B
•
Updated
•
566
•
1
RedHatAI/Qwen2.5-72B-quantized.w8a8
Text Generation
•
73B
•
Updated
•
11
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
•
15B
•
Updated
•
23
•
2
RedHatAI/Qwen2.5-14B-FP8-dynamic
Text Generation
•
15B
•
Updated
•
8
•
2
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation
•
8B
•
Updated
•
21
•
1
RedHatAI/Qwen2.5-3B-FP8-dynamic
Text Generation
•
3B
•
Updated
•
11
RedHatAI/Qwen2.5-1.5B-FP8-dynamic
Text Generation
•
2B
•
Updated
•
697
RedHatAI/Qwen2.5-0.5B-FP8-dynamic
Text Generation
•
0.6B
•
Updated
•
8
RedHatAI/Qwen2.5-3B-quantized.w8a8
Text Generation
•
3B
•
Updated
•
12
•
1
RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Text Generation
•
2B
•
Updated
•
55.8k
•
1
RedHatAI/Qwen2.5-0.5B-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
54
RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w8a8
Text Generation
•
406B
•
Updated
•
42
•
1