Fireworks

Fast, cost-efficient hosting for open-weight models.

Fireworks specialises in optimised, low-cost hosting of open-weight models — Llama, GPT-OSS, DeepSeek, Qwen, Mistral, and dozens of fine-tuned variants — across serverless and dedicated tiers. Globally hosted by default — serverless is not region-pinned. Competitive on price-per-token for high-throughput open-weight workloads where cost matters more than feature parity with closed frontier models.

1 route18 modelsMulti

fireworks.ai

Models on Fireworks

Every model we route through Fireworks. Compare residency, ZDR, training posture, and price at a glance — full data-handling detail per route below.

Model	Region	Zero data retention	Training	Context	Input	Output
Kimi K3 Moonshot	Multi	Enterprise	No	1M	$3.00	$15.00
GLM-5.2 Z.ai	Multi	Enterprise	No	1M	$1.40	$4.40
MiniMax M3 MiniMax	Multi	Enterprise	No	512K	$0.30	$1.20
DeepSeek V4 Pro DeepSeek	Multi	Enterprise	No	1M	$1.74	$3.48
Kimi K2.6 Moonshot	Multi	Enterprise	No	262K	$0.95	$4.00
Kimi K2.7 Code Moonshot	Multi	Enterprise	No	262K	$0.95	$4.00
DeepSeek V4 Flash DeepSeek	Multi	Enterprise	No	1M	$0.14	$0.28
GLM 5.1 Z.ai	Multi	Enterprise	No	200K	$1.40	$4.40
Qwen3.6-Plus Alibaba	Multi	Enterprise	No	1M	$0.50	$3.00
Qwen3.7-Plus Alibaba	Multi	Enterprise	No	262K	$0.40	$1.60
MiniMax M2.7 MiniMax	Multi	Enterprise	No	197K	$0.30	$1.20
Kimi K2.5 Moonshot	Multi	Enterprise	No	262K	$0.60	$3.00
MiniMax M2.5 MiniMax	Multi	Enterprise	No	197K	$0.30	$1.20
GPT OSS 120B OpenAI	Multi	Enterprise	No	131K	$0.15	$0.60
GPT OSS 20B OpenAI	Multi	Enterprise	No	131K	$0.07	$0.30
Kimi K2.7 Code Fast Moonshot	Multi	Enterprise	No	262K	$1.90	$8.00
GLM-5.2 Fast Z.ai	Multi	Enterprise	No	1M	$2.10	$6.60
NVIDIA Nemotron 3 Ultra NVFP4 NVIDIA	Multi	Enterprise	No	262K	$0.60	$2.40

Data handling per route

Fireworks hosts on 1 route. Each route has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

United States🇺🇸

Zero data retention is available via Opper Enterprise contract. No training on customer data. GLOBAL; SCCs; DPA available.

Zero data retention: Available via Opper Enterprise contract.
Training: No training on customer data.
Logging: None
Third-party access: None disclosed
GDPR DPA: DPA available
Transfer mechanism: SCCs