jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated 43 minutes ago
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples-thinking 16B • Updated about 1 hour ago
jayzou3773/qwen3-moe-expert_drop-layerwise_pruning-r64-s1k-128samples-thinking 16B • Updated about 1 hour ago
jayzou3773/qwen3-moe-expert_drop-bias_pruning-r64-s1k-128samples-thinking 16B • Updated about 1 hour ago
jayzou3773/qwen3-moe-neuron_structure_drop-p50-s1k-128samples-thinking 16B • Updated about 1 hour ago
jayzou3773/qwen3_5-moe-expert_drop-weight_magnitude_pruning-r128-s1k-128samples 19B • Updated 3 days ago • 145
jayzou3773/qwen3_5-moe-expert_drop-pure_gradient_pruning-r128-s1k-128samples 19B • Updated 3 days ago • 84
jayzou3773/qwen3_5-moe-expert_drop-pure_expert_gradient_pruning-r128-s1k-128samples 19B • Updated 3 days ago • 83
jayzou3773/qwen3_5-moe-expert_drop-layerwise_pruning-r128-s1k-128samples 19B • Updated 3 days ago • 83
jayzou3773/qwen3-moe-expert_drop-weight_magnitude_pruning-r64-s1k-128samples 16B • Updated 3 days ago • 55
jayzou3773/qwen3-moe-expert_drop-pure_gradient_pruning-r64-s1k-128samples 16B • Updated 3 days ago • 52
jayzou3773/qwen3-moe-expert_drop-pure_expert_gradient_pruning-r64-s1k-128samples 16B • Updated 3 days ago • 59