M

MiMo V2 Flash

Free
Codecodellmopen source

Xiaomi's ultra-fast open-source LLM with 309B total parameters (15B active). Features Mixture-of-Experts architecture, Hybrid Sliding Window Attention, and Multi-Token Prediction. Supports 256K context window and achieves 150+ tokens per second inference speed. Excellent for reasoning, coding, and agentic workflows.

Alternatives & Related Tools