view article Article Ulysses Sequence Parallelism: Training with Million-Token Contexts 20 days ago • 24
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 19 days ago • 79
Towards Scalable Pre-training of Visual Tokenizers for Generation Paper • 2512.13687 • Published Dec 15, 2025 • 106
view article Article Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek Jan 27 • 45
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 67
view article Article Make your ZeroGPU Spaces go brrr with ahead-of-time compilation +2 Sep 2, 2025 • 77
view article Article From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels Aug 18, 2025 • 95
MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention Paper • 2506.13585 • Published Jun 16, 2025 • 274