live·247+ tools indexed·updated daily·review methodology
Perspectives/François Chollet
François Chollet

François Chollet

AI Researcher · Creator of Keras

Google · ARC Prize Foundation

Skeptic / CriticAcademic Researcher

Creator of Keras (the most widely-used deep learning API) and the ARC-AGI benchmark, which tests abstract reasoning abilities. Argues that current LLMs lack systematic generalization and that true intelligence requires efficient program synthesis.

#keras#arc-agi#reasoning#generalization#agi-benchmark

Core Positions & Ideas

1

Keras — Deep Learning Should Be Accessible to Everyone

2015

Created Keras as a high-level, user-friendly API for building neural networks, explicitly designed to be accessible to non-specialists. His philosophy: the most important factor in AI progress is the number of people who can run experiments. Keras accelerated deep learning adoption by making it approachable.

2

The ARC-AGI Benchmark — LLMs Cannot Generalize Like Humans

2019

Created the Abstraction and Reasoning Corpus (ARC) — a set of visual pattern completion tasks trivially easy for humans but consistently difficult for state-of-the-art AI systems. His thesis: ARC exposes the gap between 'crystallized' skill (LLMs excel here) and 'fluid intelligence' (generalizing to novel situations with few examples).

3

Intelligence Is Efficient Program Synthesis — Not Pattern Matching

2019

In 'On the Measure of Intelligence' (2019), Chollet proposed defining intelligence as the ability to acquire new skills efficiently given limited prior knowledge and data. By this measure, LLMs score poorly: they require massive data to learn each skill and fail to generalize across slight variations. True intelligence requires learning programs, not patterns.

4

Skill Is Not Intelligence — Current AI Systems Are Giant Skill Libraries

2023

Argues that LLMs are essentially compressed databases of human skills, retrieved and remixed on demand. This is useful but not intelligence. Intelligence is what happens when you face a situation your training distribution doesn't cover. LLMs fail systematically in those situations — which is exactly what ARC-AGI is designed to test.

5

Test-Time Compute May Be the Key — ARC Progress Shows This

2024

After OpenAI's o3 model showed dramatic ARC-AGI improvements in late 2024, Chollet updated his view: test-time compute (letting models 'think longer' via chain-of-thought and search) may bridge the gap between skill retrieval and reasoning. He moved ARC-AGI 2 goalposts: o3 solved v1, but new harder tasks remain.

Essential Reading & Watching

Recent Writing (6)

Related Voices