Python Reinforcement Learning
上QQ阅读APP看书,第一时间看更新

Solving the Bellman equation

We can find the optimal policies by solving the Bellman optimality equation. To solve the Bellman optimality equation, we use a special technique called dynamic programming.