Running 1 Sparsely gated tiny linear experts ๐ฅ A compute-efficient and interpretable transformer FFN layer