deepseek-ai/DeepSeek-V3.2-Speciale
Text Generation
•
685B
•
Updated
•
22.5k
•
651
None defined yet.
Conditional Memory via Scalable Lookup: A New Axis of Sparsity for Large Language Models
mHC: Manifold-Constrained Hyper-Connections