Pricing

Pay only for what you use. No subscriptions.

Anthropic, OpenAI, and Google models pass through at official prices — we make no markup. Chinese models carry a small infra premium and zero topup fee on every rail. Top up once, spend anywhere.

Get $5 free credit Browse all models

Zero topup fee on every payment rail

Markup on Anthropic / OpenAI / Google

¥35(~$5)

Free credit on signup

¥0.70(~$0.10)

Minimum top-up

Per-model pricing

All prices in USD per 1M tokens unless otherwise noted. Updated weekly from upstream rate cards.

Chat

Model	Vendor	Context	Input	Output	Markup
DeepSeek V3.2CN GPT-4-class general model. The most cost-effective model in the world for code & chat.	DeepSeek	64K	$0.1400/1M	$0.2800/1M	0% pass-through
Kimi K2.5CN Long-context tool-use champion. Strong agentic behaviour, free 128K context.	Moonshot	128K	$0.6000/1M	$2.50/1M	0% pass-through
Qwen3 MaxCN 256K context. Strong on multilingual and bilingual reasoning.	Alibaba	256K	$0.6000/1M	$2.40/1M	0% pass-through
MiniMax M2.5CN Fastest Chinese chat model — 130 tokens/sec on cold start.	MiniMax	200K	$0.3000/1M	$1.20/1M	0% pass-through
Claude Sonnet 4.6Global Workhorse Anthropic model. 5× cheaper than Opus, ~90% the quality.	Anthropic	200K	$3.00/1M	$15.00/1M	0% pass-through
GPT-5.5Global Latest GPT generation. Pass-through pricing.	OpenAI	256K	$5.00/1M	$20.00/1M	0% pass-through
Gemini 3 ProGlobal 2M token context. Pass-through pricing.	Google	2M	$1.25/1M	$5.00/1M	0% pass-through

Reasoning

Model	Vendor	Context	Input	Output	Markup
DeepSeek R1CN Open reasoning model rivalling o1 on math & logic, at 1/30 the price.	DeepSeek	64K	$0.5500/1M	$2.19/1M	0% pass-through
Claude Opus 4.7Global Frontier reasoning. Pass-through pricing — exactly the official rate.	Anthropic	200K	$15.00/1M	$75.00/1M	0% pass-through
o1Global OpenAI flagship reasoning. Pass-through pricing.	OpenAI	200K	$15.00/1M	$60.00/1M	0% pass-through

Code

Model	Vendor	Context	Input	Output	Markup
GLM-4.6CN Coder-tuned. Strong on Chinese function calling and tool routing.	Zhipu	128K	$0.5000/1M	$1.50/1M	0% pass-through
Qwen3 CoderCN Coder-specialised Qwen variant. SOTA on HumanEval-CN.	Alibaba	128K	$0.3000/1M	$1.50/1M	0% pass-through

Vision

Model	Vendor	Context	Input	Output	Markup
Doubao 1.6 ProCN Strong on image understanding & Chinese OCR. Mature production model.	ByteDance	64K	$0.4000/1M	$1.20/1M	0% pass-through
GPT-4oGlobal Multimodal frontier. Pass-through pricing.	OpenAI	128K	$2.50/1M	$10.00/1M	0% pass-through

Video

Model	Vendor	Context	Input	Output	Markup
Kling 1.6CN Best Chinese text-to-video. 5s/10s clips, 720p/1080p output.	Kling	—	$0.5000 per second of video	—	0% pass-through

Payment rails — every one at zero fee.

Most aggregators charge 5–6% on top-ups. We don't. We make money on volume spread on Chinese models, not on your wallet.

USDT (TRC-20 / ERC-20)

0%topup fee

OpenRouter 5%

Stripe (Card)

0%topup fee

OpenRouter 5.5% credit card

We absorb the 2.9% gateway fee

WeChat Pay / Alipay (China rails)

0%topup fee

Not supported by OpenRouter

Bank wire + VAT invoice (China B2B)

0%topup fee

Enterprise only on most platforms

China B2B: bank wire + auto-issued 6% VAT e-invoice

Invoice arrives within 7 business days. Save your company name, tax ID, and bank details once.

Top up and request invoice

Pricing FAQ

Why is there a markup on Chinese models?

Upstream Chinese providers charge in CNY, demand prepaid corporate accounts, and rate-limit aggressively. We absorb that infra cost (Singapore + Shanghai egress, retry logic, multi-channel failover) and pass through to you at one transparent rate. The markup ranges 0–80% by group; check each model on the table above.

Why do you not mark up GPT / Claude / Gemini?

These vendors take direct USD payment and have global infra. We have nothing to add — we just route. The only reason to use Routify for them is to consolidate billing and keep one wallet across all models.

Do you keep my data?

No. By default, prompts and completions are not logged. Aggregate metadata (token counts, latency, model id) is retained for billing for 90 days. You can opt into per-request logging for debugging from the dashboard.

What about volume discounts?

Above $1k/month spend, you qualify for a VIP rate (5% discount across all models). Above $10k/month, contact us for enterprise terms with dedicated channels and SLA.

Can I bring my own API key (BYOK)?

Yes — coming Q3 2026. You will be able to attach your own Anthropic / OpenAI / Google key and pay only the routing fee (0% on the first 1M requests/month).

How is billing computed?

Per-request, per-token. Each response carries `X-Routify-Cost-USD` and `X-Routify-Cost-CNY` headers so your code can audit immediately. Dashboard shows every request line-item, exportable as CSV/JSON.