Pricing

Pay as you go, from your balance

Top up your balance, then pay only for what you use: per token for text, per megapixel for images, and per minute or per audio token for transcription — no subscription, no hidden fees.

Starter

$2.78

$2.78 balance

To get started

  • Access to all active models
  • Web chat + API
  • No subscription
Sign up & top up

Pro

Popular

$8.33

$8.83 balance

+6% bonus — regular use

  • Everything in Starter
  • +6% balance bonus
  • Great for Claude Code
Sign up & top up

Business

$27.78

$31.94 balance

+15% bonus — high volume

  • Everything in Pro
  • +15% balance bonus
  • For teams / production
Sign up & top up

Or top up a custom amount from a low minimum. Usage is billed per token at each model's rate.

Top up any amount from a low minimum. Balance bonuses: +6% on larger top-ups, +15% on the highest tier.

Worked examples

Costs are computed straight from each model's rate. Here are estimates for a few scenarios.

ModelTokens (in / out)Estimated cost
DeepSeek V4 Flash
DeepSeek · Long context, short reply
9,000 / 1,000≈ $0.0005
GPT-5 Mini
OpenAI · One short message
1,000 / 1,000≈ $0.0007
Claude Sonnet 4.6
Anthropic · One short message
1,000 / 1,000≈ $0.0054

Estimates are rounded. Your actual cost follows the real token count of each request.

Rates by model

Grouped by modality: text (per token), image (per megapixel), and transcription (per minute or per audio token).

Text

ModelRate
DeepSeek V4 Flash
DeepSeek
Input: $0.042 / 1M tokens
Output: $0.084 / 1M tokens
DeepSeek V4 Pro
DeepSeek
Input: $0.131 / 1M tokens
Output: $0.261 / 1M tokens
GPT-4.1 MiniVision
OpenAI
Input: $0.12 / 1M tokens
Output: $0.48 / 1M tokens
GPT-5 MiniVision
OpenAI
Input: $0.075 / 1M tokens
Output: $0.6 / 1M tokens
Claude Opus 4.8Vision
Anthropic
Input: $1.5 / 1M tokens
Output: $7.5 / 1M tokens
Claude Sonnet 4.6Vision
Anthropic
Input: $0.9 / 1M tokens
Output: $4.5 / 1M tokens

Image

ModelRate
FLUX.2 pro
azure-openai
Per megapixel: $0.009 / MP

Transcribe

ModelRate
GPT-4o Transcribe
OpenAI
Audio in: $1.8 / 1M tokens
Text out: $3 / 1M tokens

How billing works

Cost = token count × the model's rate. Input tokens come from the messages you send, output tokens from the length of the reply. Charged against your balance.

Cache tokens (Claude)

For Claude models, cache tokens are billed separately: cache reads are cheaper and cache writes are slightly more than the standard input rate. Other models don't break out cache tokens.