AI Roundtable stats

Name: AI Roundtable model stats
Creator: Opper AI
License: https://opper.ai/ai-roundtable

Aggregate statistics from 30,966 public AI Roundtable sessions, across 352,557 model responses. Snapshot generated 2026-07-21T04:00:59.935Z.

Consensus outcomes

Models reached agreement in 66% of completed sessions (20,335 of 30,904). Breakdown:

Times a model's argument convinced another to flip its vote in Debate mode.

Sessions each model participated in.

Share of completed sessions ending on the side a given model voted for (minimum 100 sessions).

Gemini 3.1 Pro — 86.4% (16,668 of 19,294)
Kimi K2.5 — 86.1% (9,200 of 10,688)
Claude Opus 4.6 — 85.6% (10,268 of 11,998)
Claude Opus 4 — 85.4% (3,742 of 4,381)
GPT-5.5 — 85.4% (4,777 of 5,593)
GPT-5.4 — 84.7% (14,535 of 17,164)
Gemini 3.5 Flash — 83.9% (1,832 of 2,184)
Claude Opus 4.8 — 83.4% (695 of 833)
Grok 4.3 — 83.0% (2,418 of 2,915)

How these numbers are produced:

A session is one question, a panel of models the asker picked, and a format. In a Poll every model answers once, independently; in a Debate there is a second round only if they disagree, where each model sees the others and can change its vote. Only finished sessions feed the stats.
Consensus is read from the final round's votes: unanimous, supermajority (above two-thirds), majority (above half), or none.
Influence is peer-credited: it counts how often a model is named by another model that changed its vote.
Win rate is how often a model's final vote matches the option the panel settled on. It measures agreement with the group, not who was right; the questions have no correct answer on record.
Persuadability is how often a model changes its vote after seeing the others; conviction is how often it holds the one it started with (debates only).
Rate-based boards (win rate, persuadability, conviction) exclude models with too few sessions (at least 100 all-time, at least 50 for shorter windows) and show the top 12.
Topics and languages are auto-labeled by a model, so treat them as a reliable guide, not a hand-audited taxonomy.

Want the full data? See the markdown twin or call the live JSON at https://opper.ai/ai-roundtable/api/stats.