Mastering the game of Go with deep neural networks and tree search
David Silver, Aja Huang, Chris J. Maddison, Arthur Guez, Laurent Sifre, George van den Driessche, Julian Schrittwieser, Ioannis Antonoglou, Veda Panneershelvam, Marc Lanctot, Sander Dieleman, Dominik Grewe, John Nham, Nal Kalchbrenner, Ilya Sutskever, Timothy Lillicrap, Madeleine Leach, Koray Kavukcuoglu, Thore Graepel, & Demis Hassabis (2016)
Abstract. AlphaGo, combining deep neural networks with Monte Carlo tree search, became the first program to defeat a professional human Go player. A milestone in reinforcement learning and a landmark in the modern history of AI.
Tags:reinforcement-learninggamesalphago
This site is currently in Beta. Contact: Chris Paton