-
-
-
-
-
-
Inference Providers
Active filters:
arm64
vlad-m-dev/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
•
Updated
•
1
onnx-community/distiluse-base-multilingual-v2-merged-onnx
Feature Extraction
•
Updated
•
1
halley-ai/gpt-oss-20b-MLX-4bit-gs32
Text Generation
•
21B
•
Updated
•
40
•
1
halley-ai/gpt-oss-20b-MLX-6bit-gs32
Text Generation
•
21B
•
Updated
•
27
•
1
halley-ai/gpt-oss-20b-MLX-5bit-gs32
Text Generation
•
21B
•
Updated
•
24
•
1
halley-ai/gpt-oss-120b-MLX-8bit-gs32
Text Generation
•
117B
•
Updated
•
68
•
1
halley-ai/gpt-oss-120b-MLX-bf16
Text Generation
•
117B
•
Updated
•
133
•
2
halley-ai/gpt-oss-120b-MLX-6bit-gs64
Text Generation
•
117B
•
Updated
•
50
•
1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-4bit-gs64
Text Generation
•
80B
•
Updated
•
20
•
1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-5bit-gs32
Text Generation
•
80B
•
Updated
•
19
•
1
halley-ai/Qwen3-Next-80B-A3B-Instruct-MLX-6bit-gs64
Text Generation
•
80B
•
Updated
•
17
•
1
mjbommar/glaurung-binary-tokenizer-001
Feature Extraction
•
Updated
mjbommar/glaurung-binary-tokenizer-002
Feature Extraction
•
Updated
•
1
Hellohal2064/vllm-dgx-spark-gb10
Text Generation
•
Updated