GLM 5

by Z.ai

GLM-5 is Zhipu's fifth-generation frontier foundation model released February 2026, with 744 billion total parameters and 40 billion active in a Mixture-of-Experts configuration. Zhipu reports it was trained entirely on Huawei Ascend chips using the MindSpore framework, without NVIDIA hardware, a notable step toward large-model training independent of US semiconductor supply chains. The model reports open-source state-of-the-art results in coding and agentic tasks, with real-world programming capability that Zhipu benchmarks directly against Claude Opus 4.5. Trained on 28.5 trillion tokens, it integrates DeepSeek Sparse Attention to reduce deployment cost while preserving long-context capacity, with a 200K context window. Its reasoning-first design emphasizes thought quality and multi-step problem decomposition, making it strong for software engineering, mathematics, and frontier reasoning. Released under an MIT license, GLM-5 targets developers building state-of-the-art coding agents, research systems, and reasoning-intensive applications. Its roughly two-fold scale increase over GLM-4.5 combined with sparse activation aims to deliver capability gains while keeping inference practical.

Key info

Input
Output
Features
Context window
203K
Max output
203K
Input price
$0.60 /1M
Output price
$2.08 /1M
  • US residency available
  • Zero data retention on pay-as-you-go
  • No training by default
  • GDPR DPA available

Available routes

GLM 5 runs on 4 different routes through the Opper gateway. Compare residency, ZDR, and training posture at a glance β€” full data-handling detail per route below.

ProviderRegionZero data retentionTrainingInputOutput
DeepInfraUSEnterpriseNo$0.60$2.08
GeoddUSZero data retentionNo$0.60$1.60
NebiusUSEnterpriseNo$1.00$3.20
NovitaUSEnterpriseNo$1.00$3.20

Training posture across routes: No training on prompts by default.

Data handling per route

Each route hosting GLM 5 has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

DeepInfra β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; unknown.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Limited debug logs
Third-party access
None disclosed
GDPR DPA
No DPA
Transfer mechanism
unknown

Geodd β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is on by default on Pay-as-you-go β€” no action required. No training on customer data. US; SCCs.

Zero data retention
On by default on Pay-as-you-go.
Training
No training on customer data.
Logging
None
Third-party access
Provider may share with subprocessors / partners
GDPR DPA
No DPA
Transfer mechanism
SCCs

Nebius β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Novita β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Benchmarks

Independent benchmark scores β€” composite indices for reasoning, coding, and math, plus individual eval scores where available.

Global rank#36 of 531 LLMs
TierStrong
Output speed78 tok/s
First token0.75s
Intelligence Index39.5
Coding Index44.2
Reasoning & knowledge
GPQA Diamond
82%
Humanity's Last Exam
27%
Long-context reasoning
63%
Coding
SciCode
46%
Agentic & tool use
Terminal-Bench Hard
43%
τ²-Bench Telecom
98%
Math & instruction following
IFBench
72%

Get started

Call GLM 5 through the Opper gateway with one API key. Let your coding agent set it up, or call it directly β€” Opper is drop-in compatible with the OpenAI, Anthropic, and Google AI SDKs.

Set it up with your agent

Copy this and paste it into your coding agent β€” Claude Code, Cursor, Codex, and more β€” and it'll wire up Opper for you.

Or call it directly

import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.OPPER_API_KEY,
baseURL: "https://api.opper.ai/v3/compat",
});
const completion = await client.chat.completions.create({
model: "deepinfra/zai-org/GLM-5",
messages: [{ role: "user", content: "Hello" }],
});
console.log(completion.choices[0].message.content);

Compare GLM 5 with…

Side-by-side on privacy, EU hosting, pricing, and benchmarks.

Other models from Z.ai

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
GLM 5 by Z.ai β€” pricing, benchmarks, ZDR | Opper AI