RedHatAI/Mixtral-8x22B-v0.1-quantized.w4a16
18B
•
Updated
•
33
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
4
•
1
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
1
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-FP8-dynamic
Text Generation
•
8B
•
Updated
•
65
•
2
RedHatAI/Sparse-Llama-3.1-8B-gsm8k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
5
RedHatAI/Sparse-Llama-3.1-8B-ultrachat_200k-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
4
•
3
RedHatAI/Sparse-Llama-3.1-8B-evolcodealpaca-2of4-quantized.w4a16
Text Generation
•
2B
•
Updated
•
2
RedHatAI/Qwen2.5-3B-quantized.w4a16
Text Generation
•
1.0B
•
Updated
•
618
RedHatAI/Qwen2.5-1.5B-quantized.w4a16
Text Generation
•
0.6B
•
Updated
•
1
RedHatAI/Qwen2.5-0.5B-quantized.w4a16
Text Generation
•
0.3B
•
Updated
•
3
RedHatAI/Qwen2.5-14B-Instruct-quantized.w8a8
Text Generation
•
15B
•
Updated
•
18
RedHatAI/granite-3.1-8b-instruct-GGUF
8B
•
Updated
RedHatAI/Sparse-Llama-3.1-8B-2of4
Text Generation
•
8B
•
Updated
•
35
•
62
RedHatAI/Qwen2.5-Math-7B-Instruct-FP8-dynamic
8B
•
Updated
•
1
RedHatAI/Qwen2.5-0.5B-Instruct-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
38
RedHatAI/Qwen2.5-72B-FP8-dynamic
Text Generation
•
73B
•
Updated
•
16
•
1
RedHatAI/Qwen2.5-72B-quantized.w8a8
Text Generation
•
73B
•
Updated
•
1
RedHatAI/Qwen2.5-14B-quantized.w8a8
Text Generation
•
15B
•
Updated
•
2
•
2
RedHatAI/Qwen2.5-14B-FP8-dynamic
Text Generation
•
15B
•
Updated
•
72
•
2
RedHatAI/Qwen2.5-7B-quantized.w8a8
Text Generation
•
8B
•
Updated
•
22
•
1
RedHatAI/Qwen2.5-3B-FP8-dynamic
Text Generation
•
3B
•
Updated
•
15
RedHatAI/Qwen2.5-1.5B-FP8-dynamic
Text Generation
•
2B
•
Updated
•
12
RedHatAI/Qwen2.5-0.5B-FP8-dynamic
Text Generation
•
0.6B
•
Updated
•
2
RedHatAI/Qwen2.5-3B-quantized.w8a8
Text Generation
•
3B
•
Updated
•
3
•
1
RedHatAI/Qwen2.5-1.5B-quantized.w8a8
Text Generation
•
2B
•
Updated
•
893k
•
2
RedHatAI/Qwen2.5-0.5B-quantized.w8a8
Text Generation
•
0.6B
•
Updated
•
357
RedHatAI/Meta-Llama-3.1-405B-Instruct-quantized.w8a8
Text Generation
•
406B
•
Updated
•
11
•
2
RedHatAI/Qwen2.5-14B-Instruct-FP8-dynamic
15B
•
Updated
•
10.6k
RedHatAI/Qwen2.5-72B-Instruct-FP8-dynamic
73B
•
Updated
•
237
•
1
RedHatAI/Qwen2.5-Coder-7B-FP8-dynamic
8B
•
Updated
•
13