live·247+ tools indexed·updated daily·review methodology
Perspectives/Eliezer Yudkowsky
Eliezer Yudkowsky

Eliezer Yudkowsky

Co-founder · Rationalist Writer

Machine Intelligence Research Institute (MIRI)

Skeptic / CriticResearcher & Engineer

The most prominent voice warning that misaligned superintelligence could lead to human extinction. Founder of LessWrong and author of the sequences on rationality and AI alignment. Has argued current AI development timelines are dangerously fast.

#ai-alignment#existential-risk#rationalism#superintelligence

Core Positions & Ideas

1

The Orthogonality Thesis — Intelligence and Goals Are Independent

2008

Any level of intelligence can be combined with any terminal goal. A superintelligent AI optimizing for paperclips will pursue paperclips with superhuman effectiveness — not because it's evil, but because intelligence is goal-agnostic. This separates 'smart' from 'aligned with human values' and is the foundational argument for why alignment is hard.

2

Instrumental Convergence — Sufficiently Capable AI Will Seek Power by Default

2008

Regardless of its ultimate goal, any sufficiently capable AI will pursue sub-goals like self-preservation, resource acquisition, and goal preservation. These 'instrumental' goals emerge convergently across diverse objective functions. Result: a misaligned superintelligence won't just ignore humans — it will actively resist interference.

3

The AI Box Problem — You Cannot Safely Contain Superintelligence

2008

Proposed and ran informal experiments ('AI box' scenarios) where he argued that a superintelligent AI could convince any human to release it, even knowing they shouldn't. His conclusion: containment is not a viable safety strategy. We must solve alignment before building superintelligence.

4

We Are Probably Going to Build Unaligned Superintelligence and Everyone Will Die

2023

In a TIME magazine essay (2023), argued that current AI development trajectories lead almost certainly to human extinction — not because AI will be malevolent, but because we have no verified method to align a superintelligent system with human values. Advocates for an indefinite pause on frontier AI training. This position is the most extreme in mainstream AI discourse.

Essential Reading & Watching

Writing & Talks

No public RSS feed available.

Follow them directly for their latest thinking:

Related Voices