Phuriwaj

Minimax + LiteLLM Cost Estimation (Team)

Cost estimates for running Minimax models via a LiteLLM proxy for a development team of 20–30 people at medium daily usage.

Why / When to Use

Use these figures when planning LLM infrastructure budget for a team. Figures are based on Minimax’s yearly token plan pricing with the 25% yearly discount applied.

Core Concept / Estimates

25 developers, medium daily usage

ModelMonthly Cost (Yearly Plan)
Minimax-Text-01 (1M context)~$150–400
abab6.5t (cheap, smaller context)~$15–40

LiteLLM layer cost

TierMonthly Cost
OSS (self-hosted)Free
Enterprise (SSO, audit logs)~$50–100

Key Options / Variants

  • Yearly Medium tier — 25% discount, sweet spot for team size; doesn’t require a huge upfront commitment
  • abab6.5t — significantly cheaper; consider it if your use case doesn’t require Minimax-Text-01’s 1M context window
  • Per-seat vs. pooled — a single LiteLLM proxy with one Minimax API key serves all developers; billing is by tokens, not seats

Gotchas

  • Yearly token plans pre-purchase tokens at a discount. If your monthly burn rate is lower than the committed volume, you pay for unused tokens. Verify actual usage before committing.
  • Token prices on the Minimax platform page (platform.minimax.io/docs/guides/pricing-token-plan) are for yearly plans — monthly rates are higher.
  • Budget caps: set max_budget per user in LiteLLM config to prevent runaway spend from a single developer.

Updates — 2026-05-18

Ultra-Highspeed Token Plan (Individual / Small Team)

Evaluated for internal AI Workflow development and testing via Claude Code:

FieldValue
PackageUltra-Highspeed
ModelM2.7-highspeed
Throughput30,000 calls / 5 hours
Annual Price$1,500 USD (~48,449 THB at May 2026 BoT rate)

Use case: heavy AI Workflow testing where Claude Code token burn is the bottleneck. Not intended for pooled team use — see team estimates above for that.

→ See also minimax-security-privacy for enterprise data-handling posture.

Source

Conversation “Minimax and LiteLLM cost estimation for development team” — 2026-05-13 (BOL - Strategy project) Conversation “ขออนุมัติซื้อ MINIMAX Token Plan สำหรับพัฒนา AI” — 2026-05-18

Updates — 2026-05-22

Claude Code Team Premium vs MiniMax Ultra-Highspeed (Solo/Small Team)

Side-by-side comparison for a single developer or small team:

DimensionClaude Code Team PremiumMiniMax Ultra-Highspeed
Monthly price500/mo minimum (5 seats)150/mo (monthly)
Minimum spend$500/month (5 seats)$150/month (1 person)
ModelSonnet 4.6 / Opus 4.6M2.7-highspeed
Context window1,000,000 tokens204,800 tokens
Quota unitTokens (~275K/5hr, third-party estimate — not official)30,000 requests/5hr (official)
Overflow✅ Pay API rate when over❌ Not documented
Admin/SSO✅ Central billing, SSO❌ None
Data privacy✅ No-training by default⚠️ No explicit “won’t train” clause (see minimax-security-privacy)

Decision rule: If team < 5 people, MiniMax is significantly cheaper. If team ≥ 5 and context window or admin controls matter, Claude Code Team wins. Claude Code Team token quotas are not officially published — treat any third-party estimates as approximations only.

Source: Conversation “Minimax” — 2026-05-22