Pricing

Pay as you go, from your balance

Top up your balance, then pay only for what you use: per token for text, per megapixel for images, and per minute or per audio token for transcription — no subscription, no hidden fees.

Starter

$2.78

$2.78 balance

To get started

Access to all active models
Web chat + API
No subscription

Pro

Popular

$8.33

$8.83 balance

+6% bonus — regular use

Everything in Starter
+6% balance bonus
Great for Claude Code

Business

$27.78

$31.94 balance

+15% bonus — high volume

Everything in Pro
+15% balance bonus
For teams / production

Or top up a custom amount from a low minimum. Usage is billed per token at each model's rate.

Top up any amount from a low minimum. Balance bonuses: +6% on larger top-ups, +15% on the highest tier.

Worked examples

Costs are computed straight from each model's rate. Here are estimates for a few scenarios.

Model	Tokens (in / out)	Estimated cost
DeepSeek V4 Flash DeepSeek · Long context, short reply	9,000 / 1,000	≈ $0.0005
GPT-5 Mini OpenAI · One short message	1,000 / 1,000	≈ $0.0007
Claude Sonnet 4.6 Anthropic · One short message	1,000 / 1,000	≈ $0.0054

Estimates are rounded. Your actual cost follows the real token count of each request.

Rates by model

Grouped by modality: text (per token), image (per megapixel), and transcription (per minute or per audio token).

Text

Model	Rate
DeepSeek V4 Flash DeepSeek	Input: $0.042 / 1M tokens Output: $0.084 / 1M tokens
DeepSeek V4 Pro DeepSeek	Input: $0.131 / 1M tokens Output: $0.261 / 1M tokens
GPT-4.1 MiniVision OpenAI	Input: $0.12 / 1M tokens Output: $0.48 / 1M tokens
GPT-5 MiniVision OpenAI	Input: $0.075 / 1M tokens Output: $0.6 / 1M tokens
Claude Opus 4.8Vision Anthropic	Input: $1.5 / 1M tokens Output: $7.5 / 1M tokens
Claude Sonnet 4.6Vision Anthropic	Input: $0.9 / 1M tokens Output: $4.5 / 1M tokens

Image

Model	Rate
FLUX.2 pro azure-openai	Per megapixel: $0.009 / MP

Transcribe

Model	Rate
GPT-4o Transcribe OpenAI	Audio in: $1.8 / 1M tokens Text out: $3 / 1M tokens

How billing works

Cost = token count × the model's rate. Input tokens come from the messages you send, output tokens from the length of the reply. Charged against your balance.

Cache tokens (Claude)

For Claude models, cache tokens are billed separately: cache reads are cheaper and cache writes are slightly more than the standard input rate. Other models don't break out cache tokens.