Skip to content
All resources

Go to the frontier

Playing Atari with Deep Reinforcement Learning

DeepMind2 min readFree

A plain summary, so you can get the gist here without leaving.

In 2013, DeepMind built a single program that learned to play many different Atari video games well, looking only at the raw screen pixels and the score, with no game-specific instructions.

What it is

This work combined two ideas. One is a deep neural network that can read the pixels on a screen. The other is reinforcement learning, a way of learning by trial and error where the program tries actions and gets rewarded or penalized by the outcome.

Put together, the system watched the game screen, chose moves like a joystick, and learned over many attempts which moves led to higher scores. The same approach was applied to a range of Atari titles.

The core idea

Reinforcement learning is learning from consequences. There is no teacher showing the right move. The program acts, sees the result, and gradually shifts toward choices that earn more reward over time, much like learning a game by playing it.

What made this notable is that the program received only what a human player sees, the pixels and the score, and figured out useful strategies on its own. One general method, not one hand-built bot per game, handled many games.

Why it matters

Showing that a single learning method could master many games from raw input suggested a path toward more general agents that learn skills rather than having them programmed in.

Reinforcement learning is now part of how modern AI is shaped, including the way large language models are tuned to be more helpful. For builders, this paper is an accessible early look at agents that learn by doing, an idea that keeps growing in importance.

Key points
  • Published in 2013 by DeepMind.
  • Combines a deep network for reading pixels with reinforcement learning.
  • Learns from only the screen and the score, with no game-specific rules.
  • One general method learned to play many different Atari games.
  • An early milestone for agents that learn skills by trial and error.
Open the original source

DeepMind

New to this? Come build with us.

Reading is good. Building with people is better. Our drop-ins are free and open to total beginners.