Xiaomi launches MiMo V2 Pro reasoning model

💡Xiaomi's 1M ctx reasoning model launches at $1/M tokens—test vs GPT rivals

⚡ 30-Second TL;DR

What Changed

Upgrade from MiMo-V2-Flash (41 Intelligence Index score)

Why It Matters

Offers competitive long-context reasoning via affordable API, pressuring rivals like OpenAI for enterprise use.

What To Do Next

Test MiMo-V2-Pro via Xiaomi API for 1M context reasoning benchmarks.

Who should care:Developers & AI Engineers

Web-grounded analysis with 7 cited sources.

•MiMo-V2-Flash was offered for free via API until January 20, 2026, to encourage developer adoption.[2]
•MiMo-V2-Flash achieves output speeds of approximately 150 tokens per second, positioning it competitively in speed benchmarks.[2]
•A technical report for MiMo-V2-Flash was published on arXiv in January 2026, detailing its development.[7]

•MiMo-V2-Flash uses a Mixture-of-Experts (MoE) architecture with 309B total parameters and 15B active parameters at inference time.[6][7]
•The model requires approximately 309 GB VRAM at FP16 precision (roughly 618 GB with overhead) for full-precision inference.[2]
•Designed specifically for high-speed reasoning tasks, with no image input support.[3][6]

MiMo-V2-Pro will pressure competitors to lower pricing for large-context models

Its $1/$3 per 1M tokens for 1M context undercuts typical rates for similar capabilities in frontier models.

API-only access limits MiMo-V2-Pro's ecosystem growth compared to open-weight rivals

Lack of open weights prevents fine-tuning and local deployment, restricting adoption to API-dependent applications.

2026-01

MiMo-V2-Flash technical report published on arXiv

2026-01

MiMo-V2-Flash API launched with free access until Jan 20

2026-02

MiMo-V2-Flash released with 309B/15B MoE architecture

2026-03

MiMo-V2-Pro launched as reasoning model upgrade

Factual claims are grounded in the sources below. Forward-looking analysis is AI-generated interpretation.

Weekly AI Recap

Read this week's curated digest of top AI events →

Same topic

Explore #reasoning-model

Same product