Gemma 2 2B IT

by Google

Gemma 2 2B IT is Google's instruction-tuned variant of the 2.6 billion parameter Gemma 2 base model, built from the research and technology behind the Gemini family. As a decoder-only transformer using Grouped-Query Attention (GQA), rotary positional embeddings (RoPE), and GeGLU activations, it balances capability with efficiency for deployment on laptops, desktops, and modest cloud infrastructure. The model is fine-tuned through supervised instruction-following and reinforcement learning from human feedback (RLHF) to align outputs with human preferences. It handles question answering, summarization, and reasoning tasks while keeping a minimal resource footprint, within an 8K token context. With open weights freely available, Gemma 2 2B IT lets developers fine-tune and deploy their own specialized versions without relying on a hosted API, which suits edge deployment and privacy-sensitive applications.

Key info

Input
Output
Features
Context window
8K
Max output
8K

Available routes

No routes currently available — Gemma 2 2B IT isn't routed through the Opper gateway right now. It may return.

Contact us about this model →

Available models from Google

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
Gemma 2 2B IT by Google — not currently on Opper | Opper AI