The Mystery 1T Model Everyone Thought Was DeepSeek V4 Turned Out to Be Xiaomi

An anonymous trillion-parameter model topped OpenRouter for 8 days. The entire AI community assumed it was DeepSeek V4. It was actually Xiaomi's MiMo-V2-Pro — and that matters more than the identity reveal.

xiaomimimodeepseekopenrouternews

On March 11, an anonymous model called “Hunter Alpha” appeared on OpenRouter with 1 trillion parameters and a 1 million token context window. Within hours, the consensus across X, Reddit, and HackerNews was unanimous: DeepSeek V4 had shadow-dropped.

Eight days later, Xiaomi’s AI division MiMo confirmed Hunter Alpha was actually MiMo-V2-Pro — an early test build of their upcoming agentic model. Not DeepSeek. Xiaomi. The phone company.

This story tells you three things about where Chinese AI is headed in 2026.

What Actually Happened

March 11: Hunter Alpha appears on OpenRouter. 1T parameters, 1M context, MoE architecture. No provider identified. It climbs to the top of OpenRouter’s leaderboard.

March 11-18: Everyone assumes DeepSeek V4. Financial Times had reported a March release window for V4. The specs aligned with leaked V4 rumors (1T params, new “Engram” architecture). Over 1 trillion tokens of usage accumulated in days.

March 18: Xiaomi’s MiMo division confirms ownership. The model is MiMo-V2-Pro: 1T total parameters, 42B active during inference, designed for autonomous AI agents.

March 19: Xiaomi formally announces the full MiMo-V2 family — Pro (agentic), Omni (multimodal), and a TTS model.

Why It Matters: Three Takeaways

1. The Chinese AI field has more players than you think

Most English-language coverage tracks 5-6 Chinese AI companies: DeepSeek, Alibaba/Qwen, Moonshot/Kimi, Zhipu/GLM, Baidu, and maybe ByteDance. Xiaomi isn’t on anyone’s radar for foundation models.

But MiMo’s team is led by Luo Fuli, a former senior researcher at DeepSeek who helped build their breakthrough models. The talent pipeline in China’s AI ecosystem flows in unexpected directions — from pure-play AI labs into hardware companies, telcos, and automakers.

Xiaomi has the compute (they run massive cloud infrastructure for their IoT ecosystem), the deployment surface (phones, smart home, EVs), and now apparently the models. A trillion-parameter agent model designed for Xiaomi’s hardware ecosystem is a different competitive threat than another chatbot.

2. DeepSeek V4 is delayed — and that’s unusual

The reason everyone assumed Hunter Alpha was V4: DeepSeek V4 was supposed to ship in March. The Financial Times reported it. Chinese tech outlets confirmed it. Multiple release windows (mid-February, late February, early March) came and went.

As of March 30, V4 still hasn’t launched. Chinese tech outlet Whale Lab now reports an April 2026 timeline, alongside Tencent’s new Hunyuan model.

A “V4 Lite” appeared briefly on DeepSeek’s website on March 9 — rumored at ~200B parameters, possibly a preview of V4’s architecture. But it’s not in the public API docs.

For developers: don’t wait for V4. DeepSeek V3.2 at $0.28/M input is production-ready now. When V4 lands, it’ll be an upgrade, not a paradigm shift.

3. The OpenRouter effect is real

Hunter Alpha racked up 1 trillion+ tokens of usage in 8 days — as an anonymous model with no marketing, no brand, no documentation. Just raw performance on a leaderboard.

This tells you something about how developers actually choose models in 2026: they don’t read press releases. They look at OpenRouter rankings, try the model, and switch if it’s better. Brand matters less than benchmarks.

For Chinese model providers, this is good news. The adoption barrier isn’t brand recognition — it’s accessibility. The models that show up where developers already are (OpenRouter, Cursor, Cline) get used. The ones locked behind Volcano Engine or Baidu Cloud don’t.

What This Means for Your Model Selection

The MiMo-V2-Pro reveal doesn’t change your immediate model choices — it’s not publicly available as an API yet. But it signals that:

  • More competition = lower prices. When Xiaomi, Tencent, and ByteDance are all shipping frontier models alongside DeepSeek and Qwen, pricing pressure intensifies. See current prices.
  • Agentic models are the next battleground. MiMo-V2-Pro, Kimi K2.5’s Agent Swarm, GLM-5’s agent optimization — every major Chinese provider is building for agents, not just chat.
  • Watch for MiMo on OpenRouter. If Xiaomi makes MiMo-V2-Pro publicly available (likely through OpenRouter first), it could be the dark horse of 2026. A 1T model with 42B active params is the same MoE efficiency play as DeepSeek V3, just bigger.

For now, the practical recommendation hasn’t changed. Use this framework to pick the right model for your use case. But keep MiMo on your radar.


Developing story. We’ll update when DeepSeek V4 or MiMo-V2-Pro becomes publicly available as an API.

More from the Blog