Llama-3.1-8B-Instruct

by Meta

Llama 3.1 8B Instruct is Meta's production-grade lightweight model, released July 2024. With 8 billion parameters and a 131,072-token context window, it offers a strong cost-to-capability ratio, with solid instruction following and reasoning in a compact footprint. The model supports tool calling and structured output, enabling agentic workflows and programmatic integration without moving to larger models. It handles general-purpose tasks including summarization, question answering, code generation, and classification with competitive quality for its size. Its open-weight design and multilingual training make it suitable for fine-tuning and on-premise deployment. It is a common pick where compute is limited yet enough capability remains for diverse downstream tasks beyond simple text generation.

Key info

Input
Output
Features
Context window
β€”
Max output
β€”
Input price
$0.22 /1M
Output price
$0.22 /1M
  • EU residency available
  • US residency available
  • Zero data retention on pay-as-you-go
  • No training by default
  • GDPR DPA available

Available routes

Llama-3.1-8B-Instruct runs on 3 different routes through the Opper gateway. Compare residency, ZDR, and training posture at a glance β€” full data-handling detail per route below.

ProviderRegionZero data retentionTrainingInputOutput
BergetEUZero data retentionNo$0.22$0.22
DeepInfraUSEnterpriseNo$0.02$0.05
NovitaUSEnterpriseNo$0.02$0.05

Training posture across routes: No training on prompts by default.

Data handling per route

Each route hosting Llama-3.1-8B-Instruct has its own privacy posture, residency, and GDPR terms. Postures are maintained by Opper with a last-verification timestamp.

Berget β€” SwedenπŸ‡ΈπŸ‡ͺ

Zero data retention is on by default on Pay-as-you-go β€” no action required. No training on customer data. EU; DPA available.

Zero data retention
On by default on Pay-as-you-go.
Training
No training on customer data.
Logging
None
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
Not applicable β€” data stays in EU

DeepInfra β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; unknown.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Limited debug logs
Third-party access
None disclosed
GDPR DPA
No DPA
Transfer mechanism
unknown

Novita β€” United StatesπŸ‡ΊπŸ‡Έ

Zero data retention is available via Opper Enterprise contract. No training on customer data. US; SCCs; DPA available.

Zero data retention
Available via Opper Enterprise contract.
Training
No training on customer data.
Logging
Abuse monitoring
Third-party access
None disclosed
GDPR DPA
DPA available
Transfer mechanism
SCCs

Get started

Call Llama-3.1-8B-Instruct through the Opper gateway with one API key. Let your coding agent set it up, or call it directly β€” Opper is drop-in compatible with the OpenAI, Anthropic, and Google AI SDKs.

Set it up with your agent

Copy this and paste it into your coding agent β€” Claude Code, Cursor, Codex, and more β€” and it'll wire up Opper for you.

Or call it directly

import OpenAI from "openai";
const client = new OpenAI({
apiKey: process.env.OPPER_API_KEY,
baseURL: "https://api.opper.ai/v3/compat",
});
const completion = await client.chat.completions.create({
model: "berget/meta-llama/Llama-3.1-8B-Instruct",
messages: [{ role: "user", content: "Hello" }],
});
console.log(completion.choices[0].message.content);

Compare Llama-3.1-8B-Instruct with…

Side-by-side on privacy, EU hosting, pricing, and benchmarks.

Other models from Meta

Start building with 300+ models

One API key. Every major provider. Up and running in minutes.

Get startedView Documentation
Llama-3.1-8B-Instruct by Meta β€” pricing, benchmarks, ZDR, EU/US hosting | Opper AI