Gemini 2.0 Flash

by Google

Gemini 2.0 Flash is a fast multimodal Google model that takes text, images, video, audio, and PDF input. It is tuned for speed and efficiency, which makes it a fit for high-volume workloads and interactive applications where response time matters. The model supports tool use and structured output for programmatic control flow. With a 1M token context window, it handles long documents, extended conversation history, and complex multi-turn interactions, and it performs well on visual understanding, video analysis, and other multimodal reasoning tasks.

Key info

Input
Output
Features
Context window
1M
Max output
8K
Input price
$0.15 /1M
Output price
$0.60 /1M
  • EU residency available
  • Zero data retention via Enterprise
  • No training by default
  • GDPR DPA available

Available routes

Gemini 2.0 Flash runs on 1 route through the Opper gateway. Compare residency, ZDR, and training posture at a glance β€” full data-handling detail per route below.

ProviderRegionZero data retentionTrainingInputOutput
Google CloudEUEnterpriseNo$0.15$0.60

Training posture across routes: No training on prompts by default.

Data handling per route

Each route hosting Gemini 2.0 Flash has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

Google Cloud β€” European UnionπŸ‡ͺπŸ‡Ί

Zero data retention is available via Opper Enterprise contract. No training on customer data. EU; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
Not applicable β€” data stays in EU

Benchmarks

Independent benchmark scores β€” composite indices for reasoning, coding, and math, plus individual eval scores where available.

Global rank#287 of 531 LLMs
TierEfficient
Output speed0 tok/s
First token0.00s
Intelligence Index12.3
Coding Index13.6
Math Index21.7
Reasoning & knowledge
MMLU-Pro
78%
GPQA Diamond
62%
Humanity's Last Exam
5%
Long-context reasoning
28%
Coding
LiveCodeBench
33%
SciCode
33%
Agentic & tool use
Terminal-Bench Hard
4%
τ²-Bench Telecom
30%
Math & instruction following
AIME 2025
22%
IFBench
40%

Get started

Call Gemini 2.0 Flash through the Opper gateway with one API key. Let your coding agent set it up, or call it directly β€” Opper is drop-in compatible with the OpenAI, Anthropic, and Google AI SDKs.

Set it up with your agent

Copy this and paste it into your coding agent β€” Claude Code, Cursor, Codex, and more β€” and it'll wire up Opper for you.

Or call it directly

import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.OPPER_API_KEY,
baseURL: "https://api.opper.ai/v3/compat",
});
const completion = await client.chat.completions.create({
model: "vertexai/gemini-2.0-flash",
messages: [{ role: "user", content: "Hello" }],
});
console.log(completion.choices[0].message.content);

Compare Gemini 2.0 Flash with…

Side-by-side on privacy, EU hosting, pricing, and benchmarks.

Other models from Google

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
Gemini 2.0 Flash by Google β€” pricing, benchmarks, EU hosting | Opper AI