官术网_书友最值得收藏!

Chapter 5. Tabular Learning and the Bellman Equation

In the previous chapter, we got acquainted with our first Reinforcement Learning (RL) method, cross-entropy, and saw its strengths and weaknesses. In this new part of the book, we'll look at another group of methods, called Q-learning, which have much more flexibility and power.

This chapter will establish the required background shared by those methods. We'll also revisit the FrozenLake environment and show how new concepts will fit with this environment and help us to address the issues of the environment's uncertainty.

主站蜘蛛池模板: 彭州市| 宝应县| 乌兰察布市| 织金县| 苍梧县| 梁山县| 滦南县| 京山县| 洛扎县| 隆德县| 乌鲁木齐县| 读书| 青龙| 仙居县| 砚山县| 龙川县| 淮阳县| 丹寨县| 刚察县| 普洱| 昌平区| 昌邑市| 永新县| 容城县| 淮滨县| 交口县| 崇州市| 大同县| 五原县| 大丰市| 宾阳县| 微博| 化德县| 西城区| 育儿| 太保市| 庆云县| 威远县| 西城区| 平和县| 锡林郭勒盟|