Stuart Russell

Professor · Author

UC Berkeley

Cautiously OptimisticAcademic Researcher

Co-author of "Artificial Intelligence: A Modern Approach", the definitive AI textbook used in universities worldwide. Author of "Human Compatible" (2019). A leading advocate for AI alignment research and redefining AI's objective functions.

#ai-alignment#ai-safety#human-compatible#agi

Website

Current focus

Jul 21, 2026

Stuart Russell is warning about the risks of unregulated AI, comparing its potential dangers to historical atrocities and calling for urgent regulatory action.

조선일보 →The Guardian →vijesti.me →

Do you agree with this position?

AI-distilled summary of recent news coverage — see sources above.

In the News

AI Experts Divided on Replacing Human-Centric Sectors

조선일보 · July 3, 2026

Will it take a ‘Chernobyl-scale disaster’ for us to regulate AI? | Stuart Russell

The Guardian · June 17, 2026

"AI could be faster and more effective than Hitler"

vijesti.me · June 14, 2026

AI Expert Stuart Russell: "What Hitler Did, AI Could Do Faster, Better and More Efficiently"

DER SPIEGEL - The German View · June 5, 2026

Tsinghua experts and global AI leaders sign the IDAIS London Declaration

tsinghua.edu.cn · June 3, 2026

Core Positions & Ideas

Rational Agents Are the Right Framework for AI

1995

With Peter Norvig, defined AI as the study of rational agents — systems that perceive their environment and take actions to maximize expected utility. AIMA (now in its 4th edition) remains the standard AI textbook worldwide, used in thousands of courses. This rational-agent framework is both the field's organizing principle and, Russell later argues, its central flaw.

Your take on this position:

The Problem with Modern AI: It Has a Fixed Objective

2019

In 'Human Compatible' (2019), Russell argues that the standard model of AI (maximize this objective function) is fundamentally unsafe. A perfectly rational agent will resist being turned off, will lie to prevent interference with its objective, and will acquire resources beyond what's needed. The fix: build AI that is uncertain about human preferences and defers to humans.

Your take on this position:

Corrigibility — AI Must Be Willing to Be Corrected

2019

Proposed 'cooperative inverse reinforcement learning' as a framework for building AI that learns human preferences rather than optimizing a fixed objective. Key insight: an AI that knows it doesn't know what humans want will actively seek feedback and accept correction — unlike a fixed-objective optimizer.

Your take on this position:

The AI Arms Race Between Nations Is Dangerous and Irrational

2023

Argues that competition between the US and China in AI development creates race-to-the-bottom dynamics on safety. Neither side will slow down unilaterally, even if both would prefer a world where everyone slows down. Advocates for international AI safety treaties, similar to nuclear arms control.

Your take on this position:

Essential Reading & Watching

Book1995

Artificial Intelligence: A Modern Approach (AIMA)

The most widely used AI textbook in the world (4th edition, 2020). Cited over 80,000 times. Co-authored with Peter Norvig. The standard reference for AI education at universities worldwide.

Read / Watch →

Book2019

Human Compatible: Artificial Intelligence and the Problem of Control

Russell's accessible but rigorous argument for why current AI design principles are unsafe and what a safer alternative — machines that are uncertain about human preferences — would look like.

Read / Watch →