官术网_书友最值得收藏!

Summary

My congratulations, you've made another step towards understanding modern, state-of-the-art RL methods! We learned about some very important concepts that are widely used in deep RL: the value of state, the value of actions, and the Bellman equation in various forms. We saw the value iteration method, which is a very important building block in the area of Q-learning. Finally, we got to know how value iteration can improve our FrozenLake solution.

In the next chapter, we'll learn about deep Q-networks, which started the deep RL revolution in 2013, by beating humans on lots of Atari 2600 games.

主站蜘蛛池模板: 泸水县| 尉氏县| 库伦旗| 鄂托克前旗| 鹤岗市| 盘山县| 深水埗区| 贵港市| 绩溪县| 彰化市| 安顺市| 邢台市| 桐柏县| 共和县| 泰顺县| 繁昌县| 邵东县| 桃园市| 留坝县| 祁阳县| 新田县| 泰顺县| 建昌县| 青阳县| 莒南县| 望江县| 富锦市| 深泽县| 酒泉市| 梁平县| 富顺县| 望江县| 香格里拉县| 舞阳县| 微山县| 沁水县| 比如县| 昌邑市| 清苑县| 三门峡市| 昆山市|