PRICING
Simple, Transparent Pricing
One API key. 40+ models. Auto Router saves you up to 60%.
INDIVIDUAL
Free
$0/mo
5M tokens · Chinese models
  • 10+ Chinese models
  • 5 requests / min
  • Auto Router (1 tier)
  • Community support
Get Started Free
Starter
$19/mo
40M tokens · All Chinese
  • 20+ Chinese models
  • Unlimited requests
  • Auto Router (2 tiers)
  • Email support
  • Usage dashboard
Subscribe
Pro
$49/mo
110M tokens · Chinese + Intl pay-per-use
  • Everything in Starter
  • All 40+ models (Intl +25%)
  • Auto Router (3 tiers)
  • Priority support
  • Advanced analytics
Go Pro
Max
$99/mo
220M tokens · Chinese + Intl pay-per-use
  • Everything in Pro
  • All models (Intl +20%)
  • Auto Router (4 tiers)
  • Dedicated endpoint
  • Volume discounts
Go Max
BUSINESS
Team
$199/mo
400M tokens · All models
  • 5 seats · Intl +20%
  • Dedicated endpoint
  • SLA 99.5%
  • Invoice billing
Contact Sales
Business
$499/mo
1B tokens · All models
  • 20 seats · Intl +15%
  • Dedicated endpoint + VPC
  • SLA 99.9%
  • Priority invoice · Net 30
Contact Sales
Enterprise
Custom
Unlimited · Custom SLA
  • Unlimited seats · Intl +10%
  • On-premise / VPC option
  • SLA 99.95%
  • 24/7 dedicated support
Contact Sales
Auto Router — same task, up to 60% less cost

Every API request is auto-routed to the cheapest model that gets the job done. Translate a doc? DeepSeek Flash at $0.20/1M instead of GPT-4o at $6.00/1M. Complex code review? Escalates to Claude or Qwen-Coder automatically. You get the right model at the right price — no manual switching.

MODELS
Model Catalog
Pay-per-use pricing per 1M tokens. Subscription plans include Chinese models at lower effective rates via Auto Router.
ProviderModelContextPrice / 1M tokensBest For
DeepSeekV4 Flash CN1M$0.20Fast · Q&A · Translation
DeepSeekV3.2 CN128K$0.70Coding · Logic · Math
DeepSeekR1 CN128K$0.80Deep reasoning
DeepSeekV4 Pro CN1M$0.80Advanced reasoning
AlibabaQwen3.5 Turbo CN32K$0.25Fast · Simple tasks
AlibabaQwen3.5 Plus CN32K$1.60Balanced · General
AlibabaQwen3.7 Max CN32K$3.20Chinese content · Complex
AlibabaQwen-Coder CN32K$3.20Code generation
ZhipuGLM-5 Flash CN128K$1.00Fast · Cost-efficient
ZhipuGLM-5.2 CN128K$1.80Complex reasoning
MoonshotKimi K2.6 CN128K$1.10Creative writing
MoonshotKimi K2.7 Code CN128K$1.60Coding · Code review
ByteDanceDoubao Lite CN32K$0.40Efficient · Low cost
MiniMaxM3 CN128K$1.00Creative writing
AnthropicClaude Opus 4.8 INTL200K$9.50Most complex tasks
AnthropicClaude Haiku 4.5 INTL200K$2.20Fast · Summarization
AnthropicClaude Sonnet 5 NEW200KComing soonComplex · Code review
OpenAIGPT-5.5 INTL128K$12.50Most capable
OpenAIGPT-4o INTL128K$6.00Multimodal · Vision
OpenAIGPT-4o-mini INTL128K$1.60Fast · Cost-efficient
GoogleGemini 3 Pro INTL1M$4.80Multimodal · Vision
GoogleGemini 3.5 Flash INTL1M$3.30Fast · Summarization
GoogleGemini 2.5 Pro INTL1M$1.70Balanced · Multimodal
GoogleGemini Flash Lite INTL1M$0.12Lightweight · Edge
xAIGrok 4.3 INTL128K$2.00Creative · Brainstorm
xAIGrok Build 0.1 INTL128K$0.60Efficient · Code
MistralLarge 3 INTL128K$1.90Multilingual · EU data
MetaLlama 4 Maverick INTL128K$0.50Open source
KuaishouKling 2.0 CN$0.50 / videoText-to-video
ByteDanceSeedance 2.0 DIRECT$0.50 / video (pass-through)11s · Multi-reference
AlibabaTongyi Wan2.1 CN$0.35 / videoVBench #1
Prices are blended retail rates. Detailed input/output pricing in API docs. Enterprise volume discounts — contact sales. International models may carry additional latency.
AICraft Assistant
Hi! Ask me about models, pricing, or API setup.
Powered by AICraft Auto Router