
上QQ阅读APP看书,第一时间看更新
Summary
In this chapter, we became familiar with the first RL method cross-entropy, which is simple but quite powerful, despite its limitations. We applied it to a CartPole environment (with huge success) and to FrozenLake (with much more modest success). This chapter ends the introductory part of the book.
In the upcoming chapters, we will explore more complex, but more powerful tools of deep RL.