Inference Optimization
community
AI & ML interests
None defined yet.
Recent Activity
View all activity
models 93
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.5bits
25B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5bits
20B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.5bits
22B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-7bits
27B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.75bits
26B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6.25bits
24B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.75bits
22B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-5.25bits
21B • Updated
inference-optimization/Qwen3-30B-A3B-Instruct-2507-6bits
23B • Updated
• 8
inference-optimization/GLM-4.6-FP8-dynamic
Text Generation • 353B • Updated
• 1
datasets 0
None public yet