Perspectives/Eliezer Yudkowsky

Eliezer Yudkowsky

Co-founder · Rationalist Writer

Machine Intelligence Research Institute (MIRI)

Skeptic / CriticResearcher & Engineer

The most prominent voice warning that misaligned superintelligence could lead to human extinction. Founder of LessWrong and author of the sequences on rationality and AI alignment. Has argued current AI development timelines are dangerously fast.

#ai-alignment#existential-risk#rationalism#superintelligence

Website X / Twitter

In the News

‘If anyone builds it, everyone dies’: The terrifying warning from AI experts

Stuff · June 3, 2026

Catastrophists versus accelerationists: Will AI destroy the world or save it?

EL PAÍS English · May 31, 2026

The Thinker Who Foresaw Pope Leo’s Critique of AI

compactmag.com · May 26, 2026

The One Movie You Have to See to Understand the Promise—and Threat—of AI

The National Interest · May 25, 2026

‘Maybe me too’: Elon Musk accepts some of the blame for Claude learning to blackmail users from ‘evil’ online AI stories

Fortune · May 13, 2026

Core Positions & Ideas

The Orthogonality Thesis — Intelligence and Goals Are Independent

2008

Any level of intelligence can be combined with any terminal goal. A superintelligent AI optimizing for paperclips will pursue paperclips with superhuman effectiveness — not because it's evil, but because intelligence is goal-agnostic. This separates 'smart' from 'aligned with human values' and is the foundational argument for why alignment is hard.

Your take on this position:

Instrumental Convergence — Sufficiently Capable AI Will Seek Power by Default

2008

Regardless of its ultimate goal, any sufficiently capable AI will pursue sub-goals like self-preservation, resource acquisition, and goal preservation. These 'instrumental' goals emerge convergently across diverse objective functions. Result: a misaligned superintelligence won't just ignore humans — it will actively resist interference.

Your take on this position:

The AI Box Problem — You Cannot Safely Contain Superintelligence

2008

Proposed and ran informal experiments ('AI box' scenarios) where he argued that a superintelligent AI could convince any human to release it, even knowing they shouldn't. His conclusion: containment is not a viable safety strategy. We must solve alignment before building superintelligence.

Your take on this position:

We Are Probably Going to Build Unaligned Superintelligence and Everyone Will Die

2023

In a TIME magazine essay (2023), argued that current AI development trajectories lead almost certainly to human extinction — not because AI will be malevolent, but because we have no verified method to align a superintelligent system with human values. Advocates for an indefinite pause on frontier AI training. This position is the most extreme in mainstream AI discourse.

Your take on this position:

Essential Reading & Watching

Essay2010

Harry Potter and the Methods of Rationality (HPMOR)

A 660,000-word rationalist Harry Potter fanfic that introduced thousands of engineers and scientists to Bayesian reasoning and AI safety thinking. Unlikely to be his most important work but may have influenced the most people.

Read / Watch →

Essay2009

Sequences on LessWrong (Rationality, AI, and Decision Theory)

A massive collection of essays on rationality, epistemology, decision theory, and AI alignment. Foundational texts for the AI safety community and the effective altruism movement.

Read / Watch →

Essay2023

"Pausing AI Developments Isn't Enough. We Need to Shut it All Down"

Yudkowsky's TIME op-ed responding to the 6-month AI pause letter. Argues that pausing is insufficient — we need to halt all frontier AI training indefinitely until alignment is solved. His most widely-read statement of the doomsday view.

Read / Watch →