arxiv:2407.09577
gräfics
graefics
·
AI & ML interests
None yet
Recent Activity
new activity about 15 hours ago
zai-org/GLM-5.1:[Feature request] Eliminate pre-attention RMSNorm in MLA-models via scale invariance + weight folding new activity about 15 hours ago
deepseek-ai/DeepSeek-R1:[Feature request] Eliminate pre-attention RMSNorm in MLA-models via scale invariance + weight folding new activity about 15 hours ago
moonshotai/Kimi-K2.6:[Feature request] Eliminate pre-attention RMSNorm in MLA-models via scale invariance + weight folding