François Chollet
AI Researcher · Creator of Keras
Google · ARC Prize Foundation
Creator of Keras (the most widely-used deep learning API) and the ARC-AGI benchmark, which tests abstract reasoning abilities. Argues that current LLMs lack systematic generalization and that true intelligence requires efficient program synthesis.
Core Positions & Ideas
Keras — Deep Learning Should Be Accessible to Everyone
2015Created Keras as a high-level, user-friendly API for building neural networks, explicitly designed to be accessible to non-specialists. His philosophy: the most important factor in AI progress is the number of people who can run experiments. Keras accelerated deep learning adoption by making it approachable.
The ARC-AGI Benchmark — LLMs Cannot Generalize Like Humans
2019Created the Abstraction and Reasoning Corpus (ARC) — a set of visual pattern completion tasks trivially easy for humans but consistently difficult for state-of-the-art AI systems. His thesis: ARC exposes the gap between 'crystallized' skill (LLMs excel here) and 'fluid intelligence' (generalizing to novel situations with few examples).
Intelligence Is Efficient Program Synthesis — Not Pattern Matching
2019In 'On the Measure of Intelligence' (2019), Chollet proposed defining intelligence as the ability to acquire new skills efficiently given limited prior knowledge and data. By this measure, LLMs score poorly: they require massive data to learn each skill and fail to generalize across slight variations. True intelligence requires learning programs, not patterns.
Skill Is Not Intelligence — Current AI Systems Are Giant Skill Libraries
2023Argues that LLMs are essentially compressed databases of human skills, retrieved and remixed on demand. This is useful but not intelligence. Intelligence is what happens when you face a situation your training distribution doesn't cover. LLMs fail systematically in those situations — which is exactly what ARC-AGI is designed to test.
Test-Time Compute May Be the Key — ARC Progress Shows This
2024After OpenAI's o3 model showed dramatic ARC-AGI improvements in late 2024, Chollet updated his view: test-time compute (letting models 'think longer' via chain-of-thought and search) may bridge the gap between skill retrieval and reasoning. He moved ARC-AGI 2 goalposts: o3 solved v1, but new harder tasks remain.
Essential Reading & Watching
On the Measure of Intelligence
Chollet's most important paper. Proposes a formal definition of intelligence grounded in Kolmogorov complexity and introduces ARC as a benchmark aligned with this definition. Required reading for anyone thinking seriously about AI evaluation.
Read / Watch →
ARC Prize — $1M Competition for Human-Level ARC Performance
A competition with $1M in prizes for AI systems that can solve ARC tasks at human-level performance. Designed to expose the limits of current LLMs and incentivize research into genuine generalization.
Read / Watch →
Deep Learning with Python
The definitive practical guide to Keras and deep learning. One of the bestselling ML books. Second edition (2021) updated for Keras 2 and TensorFlow 2. Accessible to beginners while technically rigorous.
Read / Watch →
Recent Writing (6)
How I think about LLM prompt engineering
Prompting as searching through a space of vector programs
October 9, 2023Read original →
The loop of progress
The best ideas are born from years of iterative refinement.
December 6, 2022Read original →
AI is cognitive automation, not cognitive autonomy
Like the rest of computer science, AI is about making computers do more, not replacing humans.
November 28, 2022Read original →
The machine that makes the thing is more valuable than the thing
People, culture, and processes over artifacts.
November 21, 2022Read original →
Education as civilization-building
Today, there is too much focus on education as job training. Here's a better framework: education as a process of civilization-building — as a tool to maximize human potential.
November 13, 2022Read original →
Coming soon
Soon to be a weekly newsletter.
November 2, 2022Read original →