live·247+ tools indexed·updated daily·review methodology
Perspectives/Stuart Russell
Stuart Russell

Stuart Russell

Professor · Author

UC Berkeley

Cautiously OptimisticAcademic Researcher

Co-author of "Artificial Intelligence: A Modern Approach", the definitive AI textbook used in universities worldwide. Author of "Human Compatible" (2019). A leading advocate for AI alignment research and redefining AI's objective functions.

#ai-alignment#ai-safety#human-compatible#agi

Core Positions & Ideas

1

Rational Agents Are the Right Framework for AI

1995

With Peter Norvig, defined AI as the study of rational agents — systems that perceive their environment and take actions to maximize expected utility. AIMA (now in its 4th edition) remains the standard AI textbook worldwide, used in thousands of courses. This rational-agent framework is both the field's organizing principle and, Russell later argues, its central flaw.

2

The Problem with Modern AI: It Has a Fixed Objective

2019

In 'Human Compatible' (2019), Russell argues that the standard model of AI (maximize this objective function) is fundamentally unsafe. A perfectly rational agent will resist being turned off, will lie to prevent interference with its objective, and will acquire resources beyond what's needed. The fix: build AI that is uncertain about human preferences and defers to humans.

3

Corrigibility — AI Must Be Willing to Be Corrected

2019

Proposed 'cooperative inverse reinforcement learning' as a framework for building AI that learns human preferences rather than optimizing a fixed objective. Key insight: an AI that knows it doesn't know what humans want will actively seek feedback and accept correction — unlike a fixed-objective optimizer.

4

The AI Arms Race Between Nations Is Dangerous and Irrational

2023

Argues that competition between the US and China in AI development creates race-to-the-bottom dynamics on safety. Neither side will slow down unilaterally, even if both would prefer a world where everyone slows down. Advocates for international AI safety treaties, similar to nuclear arms control.

Essential Reading & Watching

Writing & Talks

No public RSS feed available.

Follow them directly for their latest thinking:

Related Voices