Pricing

Pay-as-you-go pricing

Plans for developers, teams scaling AI, and enterprises.

Get started
Gateway
Platform fees
3%
Models300+ models across 25+ providers, all modalitiesSee all models & pricing →
300+ models
Routing and fallbacksManually specify fallback models per request
Prompt cachingSave money on repeated prompts
Provider native
TracingSpans, prompts, responses, custom metricsHow tracing works →
Metadata only
Bring your own key (BYOK)Use your own provider credits
Free
Rate limits
High
Spend controlsPrepaid credit balance with optional auto top-up
Credit balance
Payment
Stripe
Support
Control PlaneCharged at 5.5% on calls that use these features.
ObserveLLM-as-judge that auto-scores every generation. Judge model cost billed per use, on top of the platform fee.How Observe works →
RouteDefault model selection per org / project / function. Change models via rule edits — zero-downtime model switching, with optional fallback list.How Route works →
SteerUses Observe scores to auto-select few-shot examples and optimize prompts per function.How Steer works →
GuardPII masking, content filtering, input/output validation, custom policy rulesHow Guard works →
ComplyModel allowlists by provider/region/country, monthly budget caps, retention controls, ZDR-only mode.How Comply works →
Enterprise
Data handling
Metadata only
SLA guarantees
Audit loggingUser action logs for compliance
Custom hosting regions
SSO / SAML

Frequently asked questions

Pricing & billing

How does Gateway pricing work?

+

You pay the model providers' rates plus our 3% platform fee. No hidden charges, no per-seat fees.

How does Control Plane pricing work?

+

Control Plane features (Observe, Route, Steer, Guard, Comply, tracing) are opt-in. Calls that use any Control Plane feature are charged at 5.5% instead of 3%. Observe is billed additionally per use, since it runs its own judge model on top of your generation.

Do you mark up provider rates?

+

No. Token rates are passed through at provider cost. Our only fee is the platform fee (3% on Gateway calls, 5.5% if the call uses a Control Plane feature).

Are failed or fallback requests billed?

+

No. You are only billed for the successful model response. If a request fails and is retried on a fallback model, only the successful attempt is charged.

Are taxes (VAT) included?

+

Prices and fees are exclusive of VAT. VAT is added at checkout based on your billing country.

How do I pay?

+

You buy credits that work across all models and features. Top up manually or set up automatic top-ups to keep your balance funded. All payments are processed securely through Stripe.

Is there a minimum spend?

+

No minimums and no lock-in. Buy credits and use them at your own pace.

Do you offer volume discounts?

+

Yes. Enterprise plans include volume pricing, annual commitments, and custom invoicing. Contact us to discuss.

Contact sales

Models & migration

Which models are available?

+

300+ models from 25+ providers — Anthropic, OpenAI, Google, Meta, Mistral, DeepSeek, Alibaba, and more. Browse the full directory for benchmarks, pricing, and data residency.

Browse the model directory

Do you support streaming, tool calls, and prompt caching?

+

Yes. The gateway is OpenAI SDK compatible — streaming, tool/function calling, and provider-native prompt caching all pass through unchanged. Control Plane adds a custom caching layer on top.

How do I migrate from OpenAI or Anthropic?

+

Opper is fully OpenAI SDK compatible. Change your base URL and API key, and your existing code works as-is.

Routing & reliability

What happens if a provider is down?

+

You can pass a fallback model list per request. If the primary fails, the request is retried against the next model in the list, and you're only billed for the successful response.

Can I pin a specific model?

+

Yes. Pass the explicit model name in your request. On Control Plane, the Route feature lets you set defaults per org, project, or function and change them via rule edits — no code redeploy needed.

Data & privacy

What data does Opper store?

+

On Gateway, only analytics and error metadata — no prompts or responses. On Control Plane, traces (including prompts and responses) are stored when you enable tracing or Observe, with retention configurable per project. Enterprise contracts can include zero data retention (ZDR), and ZDR is also available on Control Plane on request.

Is my data used for training?

+

Opper never trains on your data. Per-provider data policies are listed in the models directory.

See model & provider data policies

Where is Opper hosted?

+

Opper is hosted in AWS Stockholm (EU) with enterprise-grade security, ensuring compliance with GDPR and EU data protection regulations. Custom hosting regions are available for Enterprise plans.

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation