SonicMoE: Accelerating MoE with IO and Tile-aware Optimizations
Paper
ā¢
2512.14080
ā¢
Published
ā¢
6
Large scale distributed AI model training, model parallelisation, low-level GPU acceleration, make GPUs go brrrrr