view article Article We Got Claude to Build CUDA Kernels and teach open models! +2 13 days ago β’ 128
Running 3.67k The Ultra-Scale Playbook π 3.67k The ultimate guide to training LLM on large GPU Clusters