David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, & Demis Hassabis (2016), References, Textbook of AI

David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, & Demis Hassabis (2016)

Nature, 529(7587), 484-489.

DOI: https://doi.org/10.1038/nature16961

Abstract. AlphaGo, combining deep neural networks with Monte Carlo tree search, became the first program to defeat a professional human Go player. A milestone in reinforcement learning and a landmark in the modern history of AI.

Tags: reinforcement-learning games alphago

AI tools used: Claude (research, coding, text), ChatGPT (diagrams, images), Grammarly (editing).

Mastering the game of Go with deep neural networks and tree search