view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 10 days ago • 66
Distil Efficiency Benchmarks Collection Collection of models used in the blog post www.distillabs.ai/blog/the-10x-inference-tax-you-dont-have-to-pay • 9 items • Updated 18 days ago • 3
Quantized Qwen3.5 Collection Verified models. Compatible with Transformers v5.3 and vLLM v0.16.1rc1 (nightly). Under evaluation. • 9 items • Updated 7 days ago • 9
huihui-ai/Huihui-Qwen3.5-35B-A3B-abliterated Image-Text-to-Text • 36B • Updated 18 days ago • 50.2k • 225
Qwen3-MoE Collection Compressed Qwen3 MoE models with a reduced number of experts. See additional models at https://huggingface.co/bknyaz. • 9 items • Updated Feb 11 • 3
Cerebras REAP Collection Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 30 items • Updated 23 days ago • 132