kakaocorp/kanana-2-30b-a3b-thinking-2601 Text Generation • Updated about 1 month ago • 1.25k • 54
kakaocorp/kanana-2-30b-a3b-instruct-2601 Text Generation • 31B • Updated about 1 month ago • 735 • 50
kakaocorp/kanana-2-30b-a3b-mid-2601 Text Generation • 31B • Updated about 1 month ago • 104 • 30
kakaocorp/kanana-2-30b-a3b-thinking Text Generation • 31B • Updated about 1 month ago • 229 • 39
kakaocorp/kanana-2-30b-a3b-base Text Generation • 31B • Updated about 1 month ago • 1.1k • 28
Kanana: Compute-efficient Bilingual Language Models Paper • 2502.18934 • Published Feb 26, 2025 • 65
Running 3.69k The Ultra-Scale Playbook 🌌 3.69k The ultimate guide to training LLM on large GPU Clusters
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper • 2312.00752 • Published Dec 1, 2023 • 150
Rephrasing the Web: A Recipe for Compute and Data-Efficient Language Modeling Paper • 2401.16380 • Published Jan 29, 2024 • 51