M
MiMo V2 Flash
FreeCodecodellmopen source
Xiaomi's ultra-fast open-source LLM with 309B total parameters (15B active). Features Mixture-of-Experts architecture, Hybrid Sliding Window Attention, and Multi-Token Prediction. Supports 256K context window and achieves 150+ tokens per second inference speed. Excellent for reasoning, coding, and agentic workflows.