MiniMax icon

MiniMax Review 2026: Frontier Coding at a Fraction of the Price

🇨🇳

Review of MiniMax M2.5, the model scoring 80.2% on SWE-Bench Verified at roughly 1/20th the cost of Claude Opus. Works inside Claude Code, Cline, Cursor, and more.

Share

What Is MiniMax?

MiniMax is a Shanghai-based AI company (now publicly traded on the Hong Kong Stock Exchange) that builds large language models optimized for coding and agentic workflows. Their flagship model, M2.5 (released February 12, 2026), scores 80.2% on SWE-Bench Verified: within 0.7 percentage points of Claude Opus 4.5, at roughly 1/10th to 1/20th the per-token cost.

MiniMax operates both a consumer-facing Agent app (web + mobile) and an API platform with Anthropic-compatible and OpenAI-compatible endpoints, meaning their models work inside tools you already use: Claude Code, Cline, Cursor, OpenCode, Devin Desktop, Codex CLI, and more.

Model Lineup

  • MiniMax-M2.5: Full-power variant, ~50 tokens/sec, highest benchmark scores
  • MiniMax-M2.5-Lightning: ~100 tokens/sec, optimized for latency-sensitive workloads
  • MiniMax-M2.1: Previous generation, open-weight (available on Hugging Face and via Ollama)

All models use a Mixture-of-Experts architecture: 230B total parameters with ~10B active per token, keeping inference fast and cheap despite the large parameter count. Context window is approximately 200K tokens.

What Sets It Apart

Price-performance ratio. This is the core story. M2.5 API pricing sits at ~$0.15/M input, ~$1.20/M output tokens. Claude Opus 4.6 costs $5/$25 per million. GPT-5.3-Codex is in a similar range. On SWE-Bench Verified, the gap between M2.5 (80.2%) and Opus 4.5 (80.9%) is 0.7 percentage points. The price gap is 20x-30x.

Tool compatibility. MiniMax provides both Anthropic-compatible (api.minimax.io/anthropic) and OpenAI-compatible (api.minimax.io/v1) endpoints. This means you can point Claude Code, Cline, Cursor, or virtually any tool that supports custom API endpoints at MiniMax models without any special integration work.

Token Plan subscriptions. For developers who don’t want to think about per-token costs, MiniMax sells flat-rate Token Plans (formerly called Coding Plans) starting at $20/mo. These use a rolling prompt pool that resets every 5 hours, designed for agentic coding bursts rather than sustained all-day usage.

Spec-writing behavior. M2.5 has a distinctive tendency to plan architecture before writing code. It drafts specifications, then implements against them. This improves quality on complex multi-file tasks compared to models that jump straight into code generation.

Speed. M2.5 completes SWE-Bench tasks 37% faster than M2.1, matching Claude Opus 4.6’s average completion time. The Lightning variant hits ~100 tokens/sec, roughly double most frontier models.

Pricing

Token Plan (Subscription)

PlanMonthlyDetails
Plus$20/moPersonal projects and prototyping; rolling prompt pool, 5-hour reset
Max$50/moDaily coding with agents and multimodal work
Ultra$120/moHeavy agent workflows and extended sessions

Token Plans include exclusive MCP tools (web search, image understanding) and work inside Claude Code, Cline, OpenCode, and other supported tools. (Prices verified June 2026; the lineup was previously branded “Coding Plan” with a $10 Starter tier.)

API (Pay-as-You-Go)

ModelInputOutput
M2.5~$0.15/M tokens~$1.20/M tokens
M2.5-Lightning$0.30/M tokens$2.40/M tokens

Agent App

PlanPrice
Free (Lightning)$0/mo
Basic$19/mo
Pro$69/mo

Who It’s For

  • Developers running high-volume agentic workloads where per-token costs add up fast, batch refactoring, automated code review, CI/CD-integrated agents
  • Claude Code / Cline users who hit cost ceilings: swap the API endpoint and get comparable coding quality at a fraction of the price
  • Solo developers and small teams who want frontier-level coding assistance without $100-200/mo subscriptions
  • Teams evaluating alternatives to DeepSeek: similar price point, strong coding benchmarks, better out-of-the-box integration with Western coding tools

Who It’s Not For

  • Enterprise teams needing SSO, SOC 2, or compliance features: not available yet
  • Developers who need the absolute best model for non-coding tasks: Claude and GPT still lead on general reasoning, creative writing, and some specialist domains
  • Users who want a large community and mature documentation: MiniMax’s ecosystem is growing fast but still smaller than the major providers

Sources

Key Features

AI chat
Agentic mode
Multi-file editing
Terminal commands
Test generation
Debugging
Git integration
Spec-writing (plans architecture before coding)
MCP tool support
Anthropic-compatible API
OpenAI-compatible API
Function calling
Interleaved thinking

Supported Models

MiniMax model family

MiniMax-M2.5 MiniMax-M2.5-Lightning MiniMax-M2.1 MiniMax-M2.1-Lightning

MiniMax Pricing

Most Popular

Plus

$20 /mo

Token Plan; personal projects and prototyping

  • Token Plan; personal projects and prototyping

Max

$50 /mo

Token Plan; daily coding with agents and multimodal work

  • Token Plan; daily coding with agents and multimodal work

Ultra

$120 /mo

Token Plan; heavy agent workflows and extended sessions

  • Token Plan; heavy agent workflows and extended sessions
Billing: Subscription + API Verified Jun 5, 2026. Official pricing →

Plans, features, and usage limits may change. Always check MiniMax's official pricing for the latest details.

Confirmed Features

Agentic Mode
Multi-File Editing
Terminal Commands
Git Integration

Platform Support

Platforms: Web, Desktop, API

IDEs: VS Code (via Cline, Kilo Code, Roo Code), Cursor, Devin Desktop (formerly Windsurf), JetBrains (via plugins), Claude Code (CLI), OpenCode (CLI), Codex CLI, Web

Share