view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 Dec 1, 2025 • 296
Running on CPU Upgrade Featured 2.98k The Smol Training Playbook 📚 2.98k The secrets to building world-class LLMs
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1
ericzhang0328/loopllama3.2-1b-deepspeed-0904-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1
ericzhang0328/llama3.2-1b-cpt-deepspeed-slimpajama-6B Text Generation • 1B • Updated Sep 14, 2025 • 1
Efficient 3D Recognition with Event-driven Spike Sparse Convolution Paper • 2412.07360 • Published Dec 10, 2024 • 1