Dynamic Programming in reinforcement-learning applications