-
-
-
-
-
-
Inference Providers
Active filters:
vLLM
Image-Text-to-Text
•
17B
•
Updated
•
744
•
19
QuantTrio/Seed-OSS-36B-Instruct-AWQ
Text Generation
•
36B
•
Updated
•
385
•
7
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
•
36B
•
Updated
•
132
•
4
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
•
36B
•
Updated
•
28
•
5
QuantTrio/Seed-OSS-36B-Instruct-GPTQ-Int3
Text Generation
•
34B
•
Updated
•
8
•
3
amakhov/tiny-random-llama
Text Generation
•
4.18M
•
Updated
•
16
Text Generation
•
41B
•
Updated
•
4
•
2
QuantTrio/DeepSeek-V3.1-AWQ
Text Generation
•
485B
•
Updated
•
810
•
5
QuantTrio/DeepSeek-V3.1-AWQ-Fp16Mix
Text Generation
•
286B
•
Updated
•
5
•
1
QuantTrio/DeepSeek-V3.1-AWQ-Lite
Text Generation
•
684B
•
Updated
•
27
•
3
JunHowie/Qwen3-4B-Instruct-2507-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
2.88k
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int4
Text Generation
•
4B
•
Updated
•
259
•
1
JunHowie/Qwen3-4B-Thinking-2507-GPTQ-Int8
Text Generation
•
4B
•
Updated
•
243
•
2
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
7.4k
JunHowie/Qwen3-30B-A3B-Instruct-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
6
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int4
Text Generation
•
31B
•
Updated
•
49
JunHowie/Qwen2-7B-Instruct-GPTQ-Int4
Text Generation
•
8B
•
Updated
•
8
JunHowie/Qwen2-7B-Instruct-GPTQ-Int8
Text Generation
•
8B
•
Updated
•
52
EliovpAI/Deepseek-R1-0528-Qwen3-8B-FP8-KV
Text Generation
•
8B
•
Updated
•
7
JunHowie/Qwen3-30B-A3B-Thinking-2507-GPTQ-Int8
Text Generation
•
31B
•
Updated
•
27
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int4
Text Generation
•
36B
•
Updated
•
6
JunHowie/Seed-OSS-36B-Instruct-GPTQ-Int8
Text Generation
•
36B
•
Updated
•
4
QuantTrio/Qwen3-VL-235B-A22B-Instruct-AWQ
Text Generation
•
236B
•
Updated
•
823
•
11
QuantTrio/Qwen3-VL-235B-A22B-Instruct-FP8
Text Generation
•
Updated
•
31
QuantTrio/Qwen3-VL-235B-A22B-Thinking-AWQ
Text Generation
•
236B
•
Updated
•
422
•
6
QuantTrio/Qwen3-VL-235B-A22B-Thinking-FP8
Text Generation
•
236B
•
Updated
•
93
QuantTrio/DeepSeek-V3.2-Exp-AWQ
Text Generation
•
486B
•
Updated
•
48
•
4
QuantTrio/DeepSeek-V3.2-Exp-AWQ-Lite
Text Generation
•
685B
•
Updated
•
77
•
4
Text Generation
•
50B
•
Updated
•
3.22k
•
5
QuantTrio/GLM-4.6-GPTQ-Int4-Int8Mix
Text Generation
•
69B
•
Updated
•
212
•
4