官术网_书友最值得收藏!

  • Python Reinforcement Learning
  • Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
  • 91字
  • 2021-06-24 15:17:30

Summary

In this chapter, we learned how to set up our machine by installing Anaconda, Docker, OpenAI Gym, Universe, and TensorFlow. We also learned how to create simulations using OpenAI and how to train agents to learn in an OpenAI environment. Then we came across the fundamentals of TensorFlow followed by visualizing graphs in TensorBoard. 

In the Chapter 3The Markov Decision Process and Dynamic Programming we will learn about Markov Decision Process and dynamic programming and how to solve frozen lake problem using value and policy iteration.

主站蜘蛛池模板: 临城县| 天峻县| 武山县| 姚安县| 松桃| 义乌市| 元阳县| 五大连池市| 西贡区| 乐至县| 博白县| 财经| 曲靖市| 兴城市| 乾安县| 克东县| 富阳市| 长葛市| 宁阳县| 邢台县| 大同县| 汕尾市| 嘉兴市| 东源县| 刚察县| 叶城县| 丹棱县| 浮梁县| 织金县| 德江县| 鞍山市| 峡江县| 渭源县| 文成县| 繁峙县| 武安市| 广丰县| 随州市| 晴隆县| 满城县| 东城区|