A plain summary, so you can get the gist here without leaving.
In 2012, three researchers trained a deep neural network on graphics cards to recognize objects in photos, and it won a major contest by such a margin that it changed how the whole field thought about machine learning.
What it is
AlexNet, named after Alex Krizhevsky and built with Ilya Sutskever and Geoffrey Hinton, is a convolutional neural network. That is a kind of model that learns to spot patterns in images by scanning small patches and combining them into larger and larger features, from edges to shapes to whole objects.
The contest was ImageNet, a challenge to correctly label photos across a thousand categories. AlexNet made far fewer mistakes than the methods that came before it. The gap was wide enough that people paid attention immediately.
The core idea
Two things came together. First, the network was deep, with many layers stacked so it could learn rich features on its own instead of relying on hand-designed rules. Second, the team trained it on GPUs, the chips originally made for video games, which could do the heavy math fast enough to make a big network practical.
There was also plenty of labeled data to learn from. Deep model, fast hardware, and lots of examples turned out to be a powerful recipe, and that combination is still the backbone of modern AI.
Why it matters
AlexNet is widely seen as the spark of the deep learning era. After it, researchers and companies poured effort into neural networks, and progress in vision, speech, and eventually language followed.
For anyone building with AI today, this paper is the origin story of the tools you use. The lesson it teaches, that scale of data and compute can unlock learning that hand-tuned methods cannot, has guided almost everything since.
- Published in 2012 by Krizhevsky, Sutskever, and Hinton.
- A deep convolutional network that learns image features layer by layer.
- Trained on GPUs, which made a large network practical to run.
- Won the ImageNet contest by a wide margin over older methods.
- Widely credited with starting the modern deep learning era.
Krizhevsky, Sutskever & Hinton
New to this? Come build with us.
Reading is good. Building with people is better. Our drop-ins are free and open to total beginners.