Collection of Quantized Models for MoE
Krishna Teja Chitty-Venkata
AI & ML interests
LLM Optimization, Neural Architecture Search, Quantization, Pruning
Recent Activity
updated
a model 24 minutes ago
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic published
a model 28 minutes ago
inference-optimization/Qwen3.5-35B-A3B-FP8-Dynamic updated
a model 1 day ago
inference-optimization/gpt-oss-20b-FP8-Dynamic