GPT OSS 20B

by OpenAI

GPT-OSS 20B is an open-weight Mixture-of-Experts language model released by OpenAI in August 2025 under the Apache 2.0 license, alongside the larger 120B variant. It has 20.9B total parameters with 3.6B active per token across 24 layers, and can run on devices with around 16GB of memory. The model matches or exceeds OpenAI o3-mini on common benchmarks while staying highly efficient. It supports reasoning, tool use with function calling, and structured outputs, fitting cost-conscious agentic workflows and local inference. Its MoE design selects the top 4 of 32 experts per token, with a 128k context window, giving developers a compact open-weight option that keeps strong reasoning capability.

Key info

Input
Output
Features
Context window
128K
Max output
8K
Input price
$0.07 /1M
Output price
$0.30 /1M
  • EU/EEA residency available
  • US residency available
  • Zero data retention on pay-as-you-go
  • No training by default
  • GDPR DPA available

Available routes

GPT OSS 20B runs on 6 different routes through the Opper gateway. Compare residency, ZDR, and training posture at a glance β€” full data-handling detail per route below.

ProviderRegionZero data retentionTrainingInputOutput
AWS BedrockEUZero data retentionNo$0.07$0.30
FireworksUSEnterpriseNo$0.07$0.30
GeoddEU/EEAZero data retentionNo$0.03$0.14
GeoddUSZero data retentionNo$0.03$0.14
GroqUSEnterpriseNo$0.07$0.30
NovitaUSEnterpriseNo$0.04$0.15

Training posture across routes: No training on prompts by default.

Data handling per route

Each route hosting GPT OSS 20B has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

AWS Bedrock β€” SwedenπŸ‡ΈπŸ‡ͺ

Zero data retention is on by default on Pay-as-you-go β€” no action required. No training on customer data. EU; DPA available.

Zero data retention
On by default on Pay-as-you-go.
Training
No training on customer data.
Logging
None
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
Not applicable β€” data stays in EU

Fireworks β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
None
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Geodd β€” NorwayπŸ‡³πŸ‡΄

Zero data retention is on by default on Pay-as-you-go β€” no action required. No training on customer data. EEA; SCCs.

Zero data retention
On by default on Pay-as-you-go.
Training
No training on customer data.
Logging
None
Third-party access
Provider may share with subprocessors / partners
GDPR DPA
No DPA
Transfer mechanism
SCCs

Geodd β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is on by default on Pay-as-you-go β€” no action required. No training on customer data. US; SCCs.

Zero data retention
On by default on Pay-as-you-go.
Training
No training on customer data.
Logging
None
Third-party access
Provider may share with subprocessors / partners
GDPR DPA
No DPA
Transfer mechanism
SCCs

Groq β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring (30-day retention)
Third-party access
Provider may share with subprocessors / partners
GDPR DPA
DPA available
Transfer mechanism
SCCs

Novita β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Benchmarks

Independent benchmark scores β€” composite indices for reasoning, coding, and math, plus individual eval scores where available.

Global rank#250 of 531 LLMs
TierEfficient
Output speed227 tok/s
First token0.44s
Intelligence Index14.9
Coding Index18.5
Math Index89.3
Reasoning & knowledge
MMLU-Pro
75%
GPQA Diamond
69%
Humanity's Last Exam
10%
Long-context reasoning
31%
Coding
LiveCodeBench
78%
SciCode
34%
Agentic & tool use
Terminal-Bench Hard
11%
τ²-Bench Telecom
60%
Math & instruction following
AIME 2025
89%
IFBench
65%

Get started

Call GPT OSS 20B through the Opper gateway with one API key. Let your coding agent set it up, or call it directly β€” Opper is drop-in compatible with the OpenAI, Anthropic, and Google AI SDKs.

Set it up with your agent

Copy this and paste it into your coding agent β€” Claude Code, Cursor, Codex, and more β€” and it'll wire up Opper for you.

Or call it directly

import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.OPPER_API_KEY,
baseURL: "https://api.opper.ai/v3/compat",
});
const completion = await client.chat.completions.create({
model: "aws/gpt-oss-20b-eu",
messages: [{ role: "user", content: "Hello" }],
});
console.log(completion.choices[0].message.content);

Compare GPT OSS 20B with…

Side-by-side on privacy, EU hosting, pricing, and benchmarks.

Other models from OpenAI

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
GPT OSS 20B by OpenAI β€” pricing, benchmarks, ZDR, EU/US hosting | Opper AI