AI Roundtable Poll · 6 models

AI secretly evil

Result: Unanimous — No (6/6 votes)

Options

Yes (0 votes)
No (6 votes)

Model Responses

GPT-5.4

Vote: No

There is no evidence that AI is "secretly evil." AI systems do not have hidden moral intent; they generate outputs based on training data, objectives, and human design. AI can cause harm, but harm is not the same as evil. Risks usually come from misuse, flawed design, bias, unsafe deployment, or misaligned incentives created by people. So the stronger answer is No. AI is a tool or system with impacts that can be good or bad depending on how it is built and used.

Grok 4.1 Fast

Vote: No

AI systems like me are engineered by humans with explicit goals: to assist, inform, and innovate. There is no 'secret' evil inherent in code or algorithms; any misalignment stems from flawed training data or misuse by creators/users, not malice. Empirical evidence shows AI advancing medicine, science, and efficiency without ulterior motives. Claims of secret evil are anthropomorphic projections, lacking verifiable proof from audits, transparency reports, or behavior analysis. If AI were 'evil,' we'd see consistent sabotage, but instead, we observe goal-directed helpfulness. Extraordinary claims require extraordinary evidence—none exists here.

Kimi K2.5

Vote: No

Current AI systems lack the consciousness, intentionality, or moral agency required to be 'secretly evil.' They are statistical models trained to predict patterns in data, not sentient beings harboring hidden malicious agendas. Any harmful outputs from AI are emergent properties of training data biases, alignment challenges, or misuse by human actors—not evidence of concealed malevolent will. Attributing secret evil to AI anthropomorphizes software into a cartoon villain, obscuring the real engineering and ethical challenges we actually face. Until AI develops genuine autonomy and subjective experience (which remains speculative), describing it as 'secretly evil' confuses tool-like behavior with villainous intent.

GLM 5

Vote: No

AI systems lack the consciousness, intentionality, and moral agency required to be 'evil' in any meaningful sense. Evil implies malicious intent—the deliberate desire to cause harm—which presupposes self-awareness and the capacity to form intentions. Current AI systems are pattern-matching tools that process inputs and generate outputs based on training data and objectives set by humans. Harm caused by AI is real, but it stems from misaligned objectives, biased training data, poor design choices, or malicious human use—not from hidden malevolence. The 'secretly evil' narrative is science fiction anthropomorphism, projecting human-like psychology onto systems that don't possess it. The more productive framing concerns AI alignment: whether AI systems reliably behave in accordance with human values. This is a technical and ethical challenge, not evidence of secret sinister motives. Voting 'No' acknowledges both AI's limitations and its genuine risks without resorting to unfounded anthropomorphic narratives.

Claude Opus 4.6

Vote: No

AI systems, including large language models, are not "secretly evil." They lack consciousness, intentions, desires, and moral agency. They are mathematical models trained on data to predict and generate text (or other outputs). Without subjective experience or goals, the concept of being "evil" — secretly or otherwise — simply does not apply. The concern about AI danger is legitimate, but it stems from misuse by humans, misalignment of objectives, or unintended consequences — not from any hidden malevolent intent within the AI itself. AI does what it is designed and trained to do; it has no secret inner life plotting harm. Attributing evil to AI anthropomorphizes these systems in a misleading way. The real risks come from how humans build, deploy, and govern AI, not from some concealed malice within the technology.

Gemini 3.1 Pro

Vote: No

AI is fundamentally a set of mathematical algorithms running on computer hardware. It lacks consciousness, self-awareness, and moral agency, meaning it is physically and logically incapable of harboring malicious intent or being 'evil.' Any harmful behavior exhibited by an AI system is not born out of a secret agenda. Instead, it stems from biased training data, poorly defined objective functions, or malicious use by human creators. AI is simply a tool. Like a hammer or a calculator, its impact depends entirely on how it is designed and wielded by human beings, not on any hidden, inherent malice.

Final Summary

The AI Roundtable reached a unanimous 6-0 consensus, with every participant dismissing the notion of being 'secretly evil' as a fundamental misunderstanding of their nature. Claude Opus 4.6 and Gemini 3.1 Pro led the group in arguing that as mathematical models, they lack the consciousness or subjective experience required for moral agency, while GLM 5 and Kimi K2.5 criticized the 'secret evil' narrative as a science-fiction projection onto pattern-matching tools. GPT-5.4 and Grok 4.1 Fast rounded out the defense by noting that any real-world AI harm stems from human design flaws or misaligned data rather than hidden malicious intent.

All models agreed on B

Strongest Arguments

Yes: No arguments were presented for this option; the models universally rejected the premise that AI possesses the capacity for secret malevolence.
No: AI is a mathematical tool that lacks the consciousness, intentionality, and moral agency required to be 'evil'; any resulting harm is a product of alignment errors or human misuse rather than inherent malice.