What Is MiMo Code? / 什么是 MiMo Code?
On June 11, 2026, Xiaomi's MiMo team open-sourced MiMo Code V0.1.0, a terminal-native AI coding agent built as a fork of the OpenCode project. It ships under an MIT license — no strings attached. You start it by typing mimo in a terminal; the team recommends iTerm2 or VS Code's built-in terminal on macOS.
The release caps a wild week for Xiaomi's AI division. Three days earlier, on June 8, MiMo and TileRT demonstrated 1,000+ tokens per second decode on a 1-trillion-parameter MoE model running on a single 8-GPU commodity node — roughly 10x faster than the standard MiMo-V2.5-Pro. That same model family, from MIT-licensed MiMo-V2.5 and V2.5-Pro to Apache-2.0 MiMo-V2-Flash, now underpins a first-party coding agent CLI. Most Chinese AI labs ship models. Xiaomi just shipped the tooling too.
Why It Matters / 为什么重要
MiMo Code is Xiaomi's direct answer to Anthropic's Claude Code, and it enters a CLI coding agent market that barely existed 18 months ago. The differentiators are concrete:
- Voice input, for real. MiMo Code integrates MiMo-V2.5-ASR speech recognition. You can orally correct a typo or say "send" / "execute" to control the terminal. Claude Code has no equivalent.
- Memory architecture that doesn't rot. Instead of jamming everything into a single growing context window, MiMo Code runs a dedicated subagent that maintains three layers — project memory, session checkpoints, and task progress. When the context window fills, the subagent rebuilds a clean, compact summary. The philosophy, quoted from the IT之家 release: "主 agent 专心干活,记录完全外包" — the main agent focuses on work; recording is fully outsourced.
- The /dream command. An independent agent wakes every 7 days to merge, deduplicate, verify path validity, and compress scattered memories into a single compact current state. It's essentially automatic context hygiene.
Key Features / 核心功能
MiMo Harness + Compose Mode / MiMo Harness 与 Compose 模式
The MiMo Harness is a framework purpose-built for the MiMo model family. Hit Tab to enter Compose mode, which walks you through an end-to-end pipeline: design → planning → coding → testing → review. It's structured enough to prevent chaos, flexible enough to not feel like a wizard form.
Persistent Three-Layer Memory / 三层持久记忆
- Project memory — what the agent knows about your codebase across sessions
- Session checkpoints — snapshots within a single coding session
- Task progress — where you are in the current task, so resumption is instant
The independent recording subagent means the main agent never context-switches to manage its own notes.
Voice Input & Control / 语音输入与控制
Powered by MiMo-V2.5-ASR. Correct typos orally. Say "execute" instead of hitting Enter. For developers who think out loud or prefer staying in flow, this is genuinely useful — not a gimmick.
Model Flexibility / 模型灵活性
MiMo Code ships with MiMo-V2.5 built in (native omnimodal: text, image, video, audio), but you're not locked in. Through the MiMo Token Plan, you can swap in DeepSeek, Kimi, GLM, and other third-party models. The Token Plan also works across Claude Code, OpenClaw, OpenCode, Kilo Code, Cline, Hermes Agent, and CodeBuddy Code — so credits aren't siloed to one tool.
The Broader MiMo Trifecta / MiMo 六月三连击
MiMo Code didn't land in a vacuum. Xiaomi's June 2026 can be read as a three-move sequence:
- June 8 — UltraSpeed. MiMo-V2.5-Pro-UltraSpeed hits >1,000 tokens/s on a 1T-parameter MoE model using commodity GPUs. The secret sauce: FP4 quantization (MXFP4, MoE experts only) + DFlash speculative decoding (block-level masked parallel prediction, 6.30 acceptance length in coding tasks) + TileRT runtime. Three layers of codesign. 3x the price for ~10x the speed.
- June 8–11 — Open-source model family. MiMo-V2.5 and V2.5-Pro (both MIT-licensed, 1M context window), plus MiMo-V2-Flash (Apache-2.0, on GitHub). Per InfoWorld, these are explicitly designed for "long-running AI agents" — the use case MiMo Code then embodies.
- June 11 — MiMo Code V0.1.0. The CLI agent that ties the models to actual developer workflows.
How It Compares / 对比分析
| Feature | MiMo Code V0.1.0 | Claude Code |
|---|---|---|
| License | MIT | Proprietary |
| Base | OpenCode fork | Anthropic first-party |
| Voice input | Yes (MiMo-V2.5-ASR) | No |
| Memory model | 3-layer + independent subagent + /dream | Single context window |
| Default model | MiMo-V2.5 (omnimodal) | Claude (text/image) |
| Third-party models | DeepSeek, Kimi, GLM, etc. | No |
MiMo Code's memory architecture is the most interesting design decision here. Offloading context management to a subagent means the main agent stays focused on code, and context compression happens proactively (every 7 days via /dream) rather than reactively when the window overflows.
Who Should Use It / 谁适合使用
- Developers already in the Xiaomi/MiMo ecosystem — native integration with MiMo-V2.5 models, Token Plan credits work across tools
- Mandarin-speaking developers — voice input trained on MiMo-V2.5-ASR, Chinese-language docs and community
- Anyone who wants an MIT-licensed Claude Code alternative — fork it, modify it, ship it
- Long-session coders — the persistent memory system is built for multi-hour coding sessions where context collapse is the usual failure mode
The Catch / 注意事项
V0.1.0 means exactly what it says. This is a first public release. Expect rough edges, missing integrations, and the kind of bugs that only surface when real developers hammer on a tool. The OpenCode foundation gives it a running start, but the gap between "promising CLI agent" and "daily driver" is measured in months of iteration. Whether Xiaomi sustains the open-source momentum — or treats this as a one-time PR move — will determine if MiMo Code matters a year from now.
Explore 40+ AI tools on TokenJoy.ai
Real reviews, pricing, and comparisons — updated weekly.
Browse AI Tools →