官术网_书友最值得收藏!

Summary

In this chapter, we became familiar with the first RL method cross-entropy, which is simple but quite powerful, despite its limitations. We applied it to a CartPole environment (with huge success) and to FrozenLake (with much more modest success). This chapter ends the introductory part of the book.

In the upcoming chapters, we will explore more complex, but more powerful tools of deep RL.

主站蜘蛛池模板: 都江堰市| 望城县| 怀化市| 巩义市| 黄骅市| 保靖县| 涟水县| 建水县| 揭东县| 锡林浩特市| 阜新市| 孝感市| 三门县| 台安县| 赤水市| 南平市| 根河市| 新田县| 宣武区| 曲阜市| 苏尼特左旗| 德格县| 新闻| 阜阳市| 德兴市| 临沧市| 台前县| 微山县| 繁昌县| 绥芬河市| 陆川县| 崇州市| 和静县| 齐河县| 林周县| 威海市| 岫岩| 萍乡市| 兴化市| 岑巩县| 略阳县|