AI & ML interests
None defined yet.
Recent Activity
View all activity
Papers
MOPD: Multi-Teacher On-Policy Distillation for Capability Integration in LLM Post-Training
HySparse: A Hybrid Sparse Attention Architecture with Oracle Token Selection and KV Cache Sharing
Organization Card
Embark with pragmatic innovation.
Venture boldly into the unknown.
Challenge the AGI with deep thinking.
Ignite every curiosity with creative spark.
Ask Mi Anything!
models 25
XiaomiMiMo/MiMo-V2.5-DFlash
311B • Updated • 3
XiaomiMiMo/MiMo-Audio-Tokenizer
1B • Updated • 2.61k • 38
XiaomiMiMo/MiMo-Audio-7B-Instruct
Any-to-Any • 8B • Updated • 14.5k • 160
XiaomiMiMo/MiMo-Audio-7B-Base
Any-to-Any • 8B • Updated • 151 • 56
XiaomiMiMo/MiMo-V2.5-Pro-FP4-DFlash
Text Generation • 554B • Updated • 46.9k • 141
XiaomiMiMo/MiMo-V2.5-Base
311B • Updated • 217 • 30
XiaomiMiMo/MiMo-V2.5
311B • Updated • 215k • 343
XiaomiMiMo/MiMo-V2.5-Pro-Base
Text Generation • 1T • Updated • 226 • 40
XiaomiMiMo/MiMo-V2.5-Pro
Text Generation • 1T • Updated • 102k • • 690
XiaomiMiMo/MiMo-V2.5-ASR
Automatic Speech Recognition • 8B • Updated • 2.38k • 101