書名： Python Reinforcement Learning
作者名： Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
本章字數： 91字
更新時間： 2021-06-24 15:17:30

Summary

In this chapter, we learned how to set up our machine by installing Anaconda, Docker, OpenAI Gym, Universe, and TensorFlow. We also learned how to create simulations using OpenAI and how to train agents to learn in an OpenAI environment. Then we came across the fundamentals of TensorFlow followed by visualizing graphs in TensorBoard.

In the Chapter 3, The Markov Decision Process and Dynamic Programming we will learn about Markov Decision Process and dynamic programming and how to solve frozen lake problem using value and policy iteration.

官术网_书友最值得收藏!

Python Reinforcement Learning

Summary