Minimax + LiteLLM Cost Estimation (Team)

Cost estimates for running Minimax models via a LiteLLM proxy for a development team of 20–30 people at medium daily usage.

Why / When to Use

Use these figures when planning LLM infrastructure budget for a team. Figures are based on Minimax’s yearly token plan pricing with the 25% yearly discount applied.

Core Concept / Estimates

25 developers, medium daily usage

ModelMonthly Cost (Yearly Plan)
Minimax-Text-01 (1M context)~$150–400
abab6.5t (cheap, smaller context)~$15–40

LiteLLM layer cost

TierMonthly Cost
OSS (self-hosted)Free
Enterprise (SSO, audit logs)~$50–100

Key Options / Variants

  • Yearly Medium tier — 25% discount, sweet spot for team size; doesn’t require a huge upfront commitment
  • abab6.5t — significantly cheaper; consider it if your use case doesn’t require Minimax-Text-01’s 1M context window
  • Per-seat vs. pooled — a single LiteLLM proxy with one Minimax API key serves all developers; billing is by tokens, not seats

Gotchas

  • Yearly token plans pre-purchase tokens at a discount. If your monthly burn rate is lower than the committed volume, you pay for unused tokens. Verify actual usage before committing.
  • Token prices on the Minimax platform page (platform.minimax.io/docs/guides/pricing-token-plan) are for yearly plans — monthly rates are higher.
  • Budget caps: set max_budget per user in LiteLLM config to prevent runaway spend from a single developer.

Updates — 2026-05-18

Ultra-Highspeed Token Plan (Individual / Small Team)

Evaluated for internal AI Workflow development and testing via Claude Code:

FieldValue
PackageUltra-Highspeed
ModelM2.7-highspeed
Throughput30,000 calls / 5 hours
Annual Price$1,500 USD (~48,449 THB at May 2026 BoT rate)

Use case: heavy AI Workflow testing where Claude Code token burn is the bottleneck. Not intended for pooled team use — see team estimates above for that.

→ See also minimax-security-privacy for enterprise data-handling posture.

Source

Conversation “Minimax and LiteLLM cost estimation for development team” — 2026-05-13 (BOL - Strategy project) Conversation “ขออนุมัติซื้อ MINIMAX Token Plan สำหรับพัฒนา AI” — 2026-05-18