Providers
Xiaomi MiMo
Xiaomi MiMo is the API platform for MiMo models. OpenClaw includes a bundled xiaomi plugin that registers both an OpenAI-compatible chat provider and a speech (TTS) provider against the same XIAOMI_API_KEY.
| Property | Value |
|---|---|
| Provider id | xiaomi |
| Plugin | bundled, enabledByDefault: true |
| Auth env var | XIAOMI_API_KEY |
| Onboarding flag | --auth-choice xiaomi-api-key |
| Direct CLI flag | --xiaomi-api-key <key> |
| Contracts | chat completions + speechProviders |
| API | OpenAI-compatible (openai-completions) |
| Base URL | https://api.xiaomimimo.com/v1 |
| Default model | xiaomi/mimo-v2-flash |
| TTS default | mimo-v2.5-tts, voice mimo_default |
Getting started
Get an API key
Create an API key in the Xiaomi MiMo console.
Run onboarding
openclaw onboard --auth-choice xiaomi-api-key
Or pass the key directly:
openclaw onboard --auth-choice xiaomi-api-key --xiaomi-api-key "$XIAOMI_API_KEY"
Verify the model is available
openclaw models list --provider xiaomi
Built-in catalog
| Model ref | Input | Context | Max output | Reasoning | Notes |
|---|---|---|---|---|---|
xiaomi/mimo-v2-flash |
text | 262,144 | 8,192 | No | Default model |
xiaomi/mimo-v2-pro |
text | 1,048,576 | 32,000 | Yes | Large context |
xiaomi/mimo-v2-omni |
text, image | 262,144 | 32,000 | Yes | Multimodal |
Text-to-speech
The bundled xiaomi plugin also registers Xiaomi MiMo as a speech provider for
messages.tts. It calls Xiaomi's chat-completions TTS contract with the text as
an assistant message and optional style guidance as a user message.
| Property | Value |
|---|---|
| TTS id | xiaomi (mimo alias) |
| Auth | XIAOMI_API_KEY |
| API | POST /v1/chat/completions with audio |
| Default | mimo-v2.5-tts, voice mimo_default |
| Output | MP3 by default; WAV when configured |
{
messages: {
tts: {
auto: "always",
provider: "xiaomi",
providers: {
xiaomi: {
apiKey: "xiaomi_api_key",
model: "mimo-v2.5-tts",
voice: "mimo_default",
format: "mp3",
style: "Bright, natural, conversational tone.",
},
},
},
},
}
Supported built-in voices include mimo_default, default_zh, default_en,
Mia, Chloe, Milo, and Dean. mimo-v2-tts is supported for older MiMo
TTS accounts; the default uses the current MiMo-V2.5 TTS model. For voice-note
targets such as Feishu and Telegram, OpenClaw transcodes Xiaomi output to 48kHz
Opus with ffmpeg before delivery.
Config example
{
env: { XIAOMI_API_KEY: "your-key" },
agents: { defaults: { model: { primary: "xiaomi/mimo-v2-flash" } } },
models: {
mode: "merge",
providers: {
xiaomi: {
baseUrl: "https://api.xiaomimimo.com/v1",
api: "openai-completions",
apiKey: "XIAOMI_API_KEY",
models: [
{
id: "mimo-v2-flash",
name: "Xiaomi MiMo V2 Flash",
reasoning: false,
input: ["text"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 262144,
maxTokens: 8192,
},
{
id: "mimo-v2-pro",
name: "Xiaomi MiMo V2 Pro",
reasoning: true,
input: ["text"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 1048576,
maxTokens: 32000,
},
{
id: "mimo-v2-omni",
name: "Xiaomi MiMo V2 Omni",
reasoning: true,
input: ["text", "image"],
cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
contextWindow: 262144,
maxTokens: 32000,
},
],
},
},
},
}
Auto-injection behavior
The xiaomi provider is injected automatically when XIAOMI_API_KEY is set in your environment or an auth profile exists. You do not need to manually configure the provider unless you want to override model metadata or the base URL.
Model details
- mimo-v2-flash — lightweight and fast, ideal for general-purpose text tasks. No reasoning support.
- mimo-v2-pro — supports reasoning with a 1M token context window for long-document workloads.
- mimo-v2-omni — reasoning-enabled multimodal model that accepts both text and image inputs.
Troubleshooting
- If models do not appear, confirm
XIAOMI_API_KEYis set and valid. - When the Gateway runs as a daemon, ensure the key is available to that process (for example in
~/.openclaw/.envor viaenv.shellEnv).