Pricing
Pay as you go, from your balance
Top up your balance, then pay only for what you use: per token for text, per megapixel for images, and per minute or per audio token for transcription — no subscription, no hidden fees.
Starter
$2.78
$2.78 balance
To get started
- Access to all active models
- Web chat + API
- No subscription
Pro
Popular$8.33
$8.83 balance
+6% bonus — regular use
- Everything in Starter
- +6% balance bonus
- Great for Claude Code
Business
$27.78
$31.94 balance
+15% bonus — high volume
- Everything in Pro
- +15% balance bonus
- For teams / production
Or top up a custom amount from a low minimum. Usage is billed per token at each model's rate.
Top up any amount from a low minimum. Balance bonuses: +6% on larger top-ups, +15% on the highest tier.
Worked examples
Costs are computed straight from each model's rate. Here are estimates for a few scenarios.
| Model | Tokens (in / out) | Estimated cost |
|---|---|---|
DeepSeek V4 Flash DeepSeek · Long context, short reply | 9,000 / 1,000 | ≈ $0.0005 |
GPT-5 Mini OpenAI · One short message | 1,000 / 1,000 | ≈ $0.0007 |
Claude Sonnet 4.6 Anthropic · One short message | 1,000 / 1,000 | ≈ $0.0054 |
Estimates are rounded. Your actual cost follows the real token count of each request.
Rates by model
Grouped by modality: text (per token), image (per megapixel), and transcription (per minute or per audio token).
Text
| Model | Rate |
|---|---|
DeepSeek V4 Flash DeepSeek | Input: $0.042 / 1M tokens Output: $0.084 / 1M tokens |
DeepSeek V4 Pro DeepSeek | Input: $0.131 / 1M tokens Output: $0.261 / 1M tokens |
GPT-4.1 MiniVision OpenAI | Input: $0.12 / 1M tokens Output: $0.48 / 1M tokens |
GPT-5 MiniVision OpenAI | Input: $0.075 / 1M tokens Output: $0.6 / 1M tokens |
Claude Opus 4.8Vision Anthropic | Input: $1.5 / 1M tokens Output: $7.5 / 1M tokens |
Claude Sonnet 4.6Vision Anthropic | Input: $0.9 / 1M tokens Output: $4.5 / 1M tokens |
Image
| Model | Rate |
|---|---|
FLUX.2 pro azure-openai | Per megapixel: $0.009 / MP |
Transcribe
| Model | Rate |
|---|---|
GPT-4o Transcribe OpenAI | Audio in: $1.8 / 1M tokens Text out: $3 / 1M tokens |
How billing works
Cost = token count × the model's rate. Input tokens come from the messages you send, output tokens from the length of the reply. Charged against your balance.
Cache tokens (Claude)
For Claude models, cache tokens are billed separately: cache reads are cheaper and cache writes are slightly more than the standard input rate. Other models don't break out cache tokens.