Fireworks

Fast, cost-efficient hosting for open-weight models.

Fireworks specialises in optimised, low-cost hosting of open-weight models — Llama, GPT-OSS, DeepSeek, Qwen, Mistral, and dozens of fine-tuned variants — across serverless and dedicated tiers. US-hosted. Competitive on price-per-token for high-throughput open-weight workloads where cost matters more than feature parity with closed frontier models.

1 route9 modelsUS
fireworks.ai

Models on Fireworks

Every model we route through Fireworks. Compare residency, ZDR, training posture, and price at a glance — full data-handling detail per route below.

ModelRegionZero data retentionTrainingContextInputOutput
USEnterpriseNo131K$0.15$0.60
USEnterpriseNo131K$0.07$0.30
USEnterpriseNo1M$1.74$3.48
USEnterpriseNo$0.50$3.00
USEnterpriseNo262K$0.95$4.00
USEnterpriseNo262K$0.60$3.00
USEnterpriseNo200K$1.40$4.40
USEnterpriseNo197K$0.30$1.20
USEnterpriseNo197K$0.30$1.20

Data handling per route

Fireworks hosts on 1 route. Each route has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

United States🇺🇸

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
None
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation