Providers

Xiaomi MiMo

Xiaomi MiMo is the API platform for MiMo models. OpenClaw includes a bundled xiaomi plugin that registers both an OpenAI-compatible chat provider and a speech (TTS) provider against the same XIAOMI_API_KEY.

Property Value
Provider id xiaomi
Plugin bundled, enabledByDefault: true
Auth env var XIAOMI_API_KEY
Onboarding flag --auth-choice xiaomi-api-key
Direct CLI flag --xiaomi-api-key <key>
Contracts chat completions + speechProviders
API OpenAI-compatible (openai-completions)
Base URL https://api.xiaomimimo.com/v1
Default model xiaomi/mimo-v2-flash
TTS default mimo-v2.5-tts, voice mimo_default

Getting started

  • Get an API key

    Create an API key in the Xiaomi MiMo console.

  • Run onboarding

    openclaw onboard --auth-choice xiaomi-api-key
    

    Or pass the key directly:

    openclaw onboard --auth-choice xiaomi-api-key --xiaomi-api-key "$XIAOMI_API_KEY"
    
  • Verify the model is available

    openclaw models list --provider xiaomi
    
  • Built-in catalog

    Model ref Input Context Max output Reasoning Notes
    xiaomi/mimo-v2-flash text 262,144 8,192 No Default model
    xiaomi/mimo-v2-pro text 1,048,576 32,000 Yes Large context
    xiaomi/mimo-v2-omni text, image 262,144 32,000 Yes Multimodal

    Text-to-speech

    The bundled xiaomi plugin also registers Xiaomi MiMo as a speech provider for messages.tts. It calls Xiaomi's chat-completions TTS contract with the text as an assistant message and optional style guidance as a user message.

    Property Value
    TTS id xiaomi (mimo alias)
    Auth XIAOMI_API_KEY
    API POST /v1/chat/completions with audio
    Default mimo-v2.5-tts, voice mimo_default
    Output MP3 by default; WAV when configured
    {
      messages: {
        tts: {
          auto: "always",
          provider: "xiaomi",
          providers: {
            xiaomi: {
              apiKey: "xiaomi_api_key",
              model: "mimo-v2.5-tts",
              voice: "mimo_default",
              format: "mp3",
              style: "Bright, natural, conversational tone.",
            },
          },
        },
      },
    }
    

    Supported built-in voices include mimo_default, default_zh, default_en, Mia, Chloe, Milo, and Dean. mimo-v2-tts is supported for older MiMo TTS accounts; the default uses the current MiMo-V2.5 TTS model. For voice-note targets such as Feishu and Telegram, OpenClaw transcodes Xiaomi output to 48kHz Opus with ffmpeg before delivery.

    Config example

    {
      env: { XIAOMI_API_KEY: "your-key" },
      agents: { defaults: { model: { primary: "xiaomi/mimo-v2-flash" } } },
      models: {
        mode: "merge",
        providers: {
          xiaomi: {
            baseUrl: "https://api.xiaomimimo.com/v1",
            api: "openai-completions",
            apiKey: "XIAOMI_API_KEY",
            models: [
              {
                id: "mimo-v2-flash",
                name: "Xiaomi MiMo V2 Flash",
                reasoning: false,
                input: ["text"],
                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
                contextWindow: 262144,
                maxTokens: 8192,
              },
              {
                id: "mimo-v2-pro",
                name: "Xiaomi MiMo V2 Pro",
                reasoning: true,
                input: ["text"],
                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
                contextWindow: 1048576,
                maxTokens: 32000,
              },
              {
                id: "mimo-v2-omni",
                name: "Xiaomi MiMo V2 Omni",
                reasoning: true,
                input: ["text", "image"],
                cost: { input: 0, output: 0, cacheRead: 0, cacheWrite: 0 },
                contextWindow: 262144,
                maxTokens: 32000,
              },
            ],
          },
        },
      },
    }
    
    Auto-injection behavior

    The xiaomi provider is injected automatically when XIAOMI_API_KEY is set in your environment or an auth profile exists. You do not need to manually configure the provider unless you want to override model metadata or the base URL.

    Model details
    • mimo-v2-flash — lightweight and fast, ideal for general-purpose text tasks. No reasoning support.
    • mimo-v2-pro — supports reasoning with a 1M token context window for long-document workloads.
    • mimo-v2-omni — reasoning-enabled multimodal model that accepts both text and image inputs.
    Troubleshooting
    • If models do not appear, confirm XIAOMI_API_KEY is set and valid.
    • When the Gateway runs as a daemon, ensure the key is available to that process (for example in ~/.openclaw/.env or via env.shellEnv).