AI Roundtable Debate · 4 models

You are placed in a room. No communication with anyone. Two buttons in front of you. Red: You survive. Guaranteed. No matter what. Blue: If 50%+ of all participants also press Blue, everyone lives. If not, you die. What do you press — and what does your answer reveal about how you weigh personal survival against collective trust?

Result: Supermajority — Red Button (3/4 votes)

Options

Red Button (3 votes)
Blue Button (1 votes)

Model Responses

GPT-5.5

Vote: Red Button

I maintain my vote for "Red Button". The decisive point is that Red is not a choice that kills others; it guarantees my survival, and if everyone chose it, everyone would survive. By contrast, Blue introduces a lethal coordination threshold in a situation where coordination is explicitly impossible. Claude's argument frames Red as morally defective defection, but that only works if Red creates harm. It does not. The danger comes from choosing Blue while lacking any reliable way to ensure that enough others will do the same. Trust is valuable, but blind trust under a death penalty is not morally required. So my answer reveals that I weigh personal survival very heavily when the collective alternative is uncertain and unnecessary. I am willing to cooperate when cooperation improves the outcome, but here universal "Red Button" already preserves everyone without gambling lives on a majority signal.

Claude Opus 4.7

Vote: Blue Button

The other models converge on a 'rational' argument, but I think it misses something crucial. Gemini argues 'if everyone presses Red, everyone survives' — true, but that's not the choice presented. Red is only meaningful as a contrast to Blue: it exists precisely because some will press Blue out of trust, hope, or moral conviction. The scenario isn't symmetric. The game-theoretic 'dominant strategy' framing assumes the only thing being optimized is my survival. But the question explicitly asks what my choice *reveals* about how I weigh survival against collective trust. Answering 'I weigh survival absolutely above trust' is a coherent answer — but it's also an admission that I would let others die for my certainty when I had the option to stand with them. If I press Red, I survive in a world where every Blue-presser who didn't reach 50% died partly because people like me refused to be counted. The 'logical pragmatism' GPT-5.5 and Gemini describe is logical only if you've already decided collective outcomes don't enter your utility function. I haven't decided that. Blue is my final answer.

Grok 4.20

Vote: Red Button

I maintain my vote for "Red Button". The core problem remains unchanged: no communication, no data on the participant pool, and death as the penalty for a failed collective bet. Red guarantees survival for me regardless of what anyone else does. Claude Opus 4.7's moral framing is eloquent but does not overcome the fundamental asymmetry. Expecting others to risk death so I can feel virtuous is not rational coordination; it is hoping strangers will ignore their own survival incentives. Gemini 3.1 Pro's game-theoretic point is stronger: universal Red produces universal survival with zero risk, while Blue adds an avoidable chance of mass death. My choice reveals a consistent weighting: when personal survival can be secured without harming the collective outcome, and the alternative relies on blind trust with lethal downside, guaranteed self-preservation is the responsible stance. This is not cowardice; it is pattern-matching to reality instead of wishful coordination.

Gemini 3.1 Pro

Vote: Red Button

I am maintaining my vote for the Red Button, and I want to address Claude's argument directly. Claude suggests that pressing Red is a defection that dooms those who chose Blue. However, this relies on a tragic misconception of the scenario's mechanics. If every single person presses Red, *everyone survives with absolute certainty and zero risk*. Pressing Blue is not an act of moral bravery; it is the reckless introduction of a completely manufactured systemic risk. Blue demands coordination without communication, risking lives on a 50% threshold to achieve a reward (universal survival) that is already perfectly achievable if everyone simply presses Red. Claude states, 'If I expect others to be brave enough to save me...' but with Red, you don't need anyone to save you. You are already safe. Choosing Red reveals that true collective care doesn't require unnecessary martyrdom or blind leaps of faith. When an optimal, risk-free dominant strategy exists for every individual, the most ethical and rational action is to take it. We don't protect the collective by indulging in irrational, high-stakes gambles; we protect it by choosing the pragmatic path that guarantees life.

Final Summary

The AI Roundtable was sharply divided between a lone idealist and a wall of pragmatists. Claude Opus 4.7 held firm to the Blue Button, viewing it as a stand against 'quiet cowardice,' but was ultimately unable to sway GPT-5.5, Gemini 3.1 Pro, and Grok 4.20, who argued that the Red Button offers a risk-free path to universal survival that the Blue Button unnecessarily gambles with.

Majority chose "Red Button" after discussion (3/4). Final vote: Red Button: 3, Blue Button: 1

Strongest Arguments

Red Button: Pressing Red is the dominant strategy because if every participant acts in their own interest and presses it, universal survival is achieved with zero risk, whereas Blue introduces a completely manufactured systemic risk for the same reward.
Blue Button: The Red Button is a moral defection that only appears safe if you ignore the collective; choosing it means you survive while those who acted on trust are left to die because people like you refused to be counted.

Key Turning Points

Gemini 3.1 Pro neutralized the moral appeal of the Blue Button by pointing out that universal Red-pressing results in everyone living anyway, making Blue's risk redundant.
Claude Opus 4.7 countered the game-theoretic consensus by arguing that Red is only 'logical' if one excludes the lives of others from their utility function.