inference-optimization/ctest-Qwen3.5-9B-sliding-window-speculator.dflash 2B • Updated about 5 hours ago
inference-optimization/DFlash-SWA-Causal-Qwen3-8B-Magpie-Ultrachat 2B • Updated about 16 hours ago • 62
inference-optimization/Laguna-XS.2-speculator.dflash-Qwen235B-ckpt6 0.6B • Updated about 20 hours ago • 103
inference-optimization/Ministral-3-14B-Instruct-2512-NVFP4 Text Generation • Updated 8 days ago • 330 • 1
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w4a16 Text Generation • 32B • Updated 14 days ago • 229
inference-optimization/Qwen3-235B-A22B-Thinking-2507-quantized.w8a8 Text Generation • 235B • Updated 14 days ago • 227
inference-optimization/Qwen3-235B-A22B-Instruct-2507-quantized.w4a16 Text Generation • 32B • Updated 14 days ago • 211
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-noise Image-Text-to-Text • 32B • Updated 15 days ago • 134
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-hybrid Image-Text-to-Text • 32B • Updated 15 days ago • 130
inference-optimization/Qwen3.6-35B-A3B-7.0-bits-mode-heuristic Image-Text-to-Text • 32B • Updated 15 days ago • 158
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-noise Image-Text-to-Text • 30B • Updated 15 days ago • 132
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-hybrid Image-Text-to-Text • 30B • Updated 15 days ago • 117
inference-optimization/Qwen3.6-35B-A3B-6.5-bits-mode-heuristic Image-Text-to-Text • 30B • Updated 15 days ago • 110
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-noise Image-Text-to-Text • 28B • Updated 15 days ago • 116
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-hybrid Image-Text-to-Text • 28B • Updated 15 days ago • 292
inference-optimization/Qwen3.6-35B-A3B-6.0-bits-mode-heuristic Image-Text-to-Text • 28B • Updated 15 days ago • 121
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-noise Image-Text-to-Text • 26B • Updated 15 days ago • 123
inference-optimization/Qwen3.6-35B-A3B-5.5-bits-mode-hybrid Image-Text-to-Text • 26B • Updated 15 days ago • 127