Chapter 3: The Markov Decision Process and Dynamic Programming