Abstract. The definitive textbook on reinforcement learning. Develops the field from multi-armed bandits through Markov decision processes, temporal-difference learning, policy gradients, and function approximation.
Tags:textbookreinforcement-learningurl-only
This site is currently in Beta. Contact: Chris Paton