References

Mastering the game of Go with deep neural networks and tree search

David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, & Demis Hassabis (2016)

Nature, 529(7587), 484-489.

DOI: https://doi.org/10.1038/nature16961

Abstract. AlphaGo, combining deep neural networks with Monte Carlo tree search, became the first program to defeat a professional human Go player. A milestone in reinforcement learning and a landmark in the modern history of AI.

Tags: reinforcement-learning games alphago

This site is currently in Beta. Contact: Chris Paton

Textbook of Usability · Textbook of Digital Health

Auckland Maths and Science Tutoring

AI tools used: Claude (research, coding, text), ChatGPT (diagrams, images), Grammarly (editing).