
Ilya Sutskever
CEO & Co-founder
Safe Superintelligence (SSI)
OpenAI co-founder and former Chief Scientist who led the GPT series and RLHF development. Founded SSI in 2024 with a singular focus on building safe superintelligence. One of the most technically respected figures in AI.
Core Positions & Ideas
Scale + Depth + Data = Intelligence (The AlexNet Bet)
2012Co-authored AlexNet with Hinton and Krizhevsky — the paper that launched the deep learning era. The thesis: larger networks, more data, and more compute consistently yield better performance. This 'scaling hypothesis' became Sutskever's organizing principle and drove the GPT series.
Scaling Laws Are Real — Performance Predicts Predictably with Compute
2020Contributed to the foundational OpenAI scaling laws paper showing that model performance improves smoothly and predictably with compute, data, and parameters. This wasn't obvious — many researchers expected diminishing returns. The scaling laws justified the massive compute investments in GPT-3, GPT-4, and beyond.
Reinforcement Learning from Human Feedback (RLHF) Is the Key to Useful AI
2022Led the team that developed RLHF to align GPT models with human preferences. The key insight: a model trained purely on text prediction is not necessarily helpful or safe; RLHF fine-tunes it to match what humans actually want. InstructGPT and ChatGPT both depend on this work.
Safety Is Existential — OpenAI Is Moving Too Fast
2023The November 2023 OpenAI board crisis (Sutskever voted to fire Sam Altman, then signed the letter calling for his return) revealed deep internal tensions about safety vs. speed. Sutskever subsequently founded SSI (Safe Superintelligence) with the singular focus of building safe superintelligence before building a product.
Synthetic Data and 'Pre-training as We Know It Will End'
2024Stated publicly (2024) that the era of internet-scale pre-training may be ending as web data is exhausted, and the next frontier is synthetic data and 'test-time compute' — models that reason longer and more carefully at inference time rather than simply being trained on more data.
Essential Reading & Watching
ImageNet Classification with Deep CNNs (AlexNet)
Co-authored the AlexNet paper that triggered the deep learning revolution. Sutskever's most direct technical contribution to the field's history.
Read / Watch →
Scaling Laws for Neural Language Models
Established that model performance scales predictably with compute, data, and parameters. The mathematical basis for the trillion-dollar bet on scaling.
Read / Watch →
Training language models to follow instructions with human feedback (InstructGPT)
The RLHF paper underlying ChatGPT. Shows how human feedback can steer language models toward being helpful, honest, and harmless.
Read / Watch →