官术网_书友最值得收藏!

  • The Reinforcement Learning Workshop
  • Alessandro Palmas Emanuele Ghelfi Dr. Alexandra Galina Petre Mayur Kulkarni Anand N.S. Quan Nguyen Aritra Sen Anthony So Saikat Basak
  • 186字
  • 2021-06-11 18:37:49

Summary

This chapter introduced us to the key technologies and concepts we can use to get started with reinforcement learning. The first two sections described two OpenAI Tools, OpenAI Gym and OpenAI Universe. These are collections that contain a large number of control problems that cover a broad spectrum of contexts, from classic tasks to video games, from browser usage to algorithm deduction. We learned how the interfaces of these environments are formalized, how to interact with them, and how to create a custom environment for a specific problem. Then, we learned how to build a policy network with TensorFlow, how to feed it with environment states to retrieve corresponding actions, and how to save the policy network weights. We also studied another OpenAI resource, Baselines. We solved problems that demonstrated how to train a reinforcement learning agent to solve a classic control task. Finally, using all the elements introduced in this chapter, we built an agent and trained it to play a classic Atari video game, thus achieving better-than-human performance.

In the next chapter, we will be delving deep into dynamic programming for reinforcement learning.

主站蜘蛛池模板: 临朐县| 敦化市| 新晃| 靖安县| 公主岭市| 武川县| 宽甸| 九龙坡区| 夹江县| 鹰潭市| 土默特右旗| 灵丘县| 龙陵县| 资源县| 界首市| 维西| 延安市| 台前县| 桃园市| 商丘市| 阿克陶县| 始兴县| 满城县| 扬中市| 陆川县| 进贤县| 太仆寺旗| 新兴县| 青海省| 永年县| 慈溪市| 永春县| 双桥区| 古蔺县| 咸宁市| 四会市| 富平县| 广南县| 南木林县| 兴安盟| 专栏|