官术网_书友最值得收藏!

  • The Reinforcement Learning Workshop
  • Alessandro Palmas Emanuele Ghelfi Dr. Alexandra Galina Petre Mayur Kulkarni Anand N.S. Quan Nguyen Aritra Sen Anthony So Saikat Basak
  • 189字
  • 2021-06-11 18:37:43

Summary

RL is one of the fundamental paradigms under the umbrella of machine learning. The principles of RL are very general and interdisciplinary, and they are not bound to a specific application.

RL considers the interaction of an agent with an external environment, taking inspiration from the human learning process. RL explicitly targets the need to explore efficiently and the exploration-exploitation trade-off appearing in almost all human problems; this is a peculiarity that distinguishes this discipline from others.

We started this chapter with a high-level description of RL, showing some interesting applications. We then introduced the main concepts of RL, describing what an agent is, what an environment is, and how an agent interacts with its environment. Finally, we implemented Gym and Baselines by showing how these libraries make RL extremely simple.

In the next chapter, we will learn more about the theory behind RL, starting with Markov chains and arriving at MDPs. We will present the two functions at the core of almost all RL algorithms, namely the state-value function, which evaluates the goodness of states, and the action-value function, which evaluates the quality of the state-action pair.

主站蜘蛛池模板: 长乐市| 老河口市| 噶尔县| 四子王旗| 竹溪县| 福州市| 宁乡县| 庐江县| 紫云| 蒙阴县| 武定县| 济南市| 静宁县| 肇东市| 定结县| 历史| 广丰县| 正安县| 枣强县| 佛冈县| 安西县| 雷波县| 英吉沙县| 高清| 天镇县| 灌南县| 卢湾区| 克拉玛依市| 鄂尔多斯市| 靖宇县| 大石桥市| 本溪市| 扎赉特旗| 唐山市| 麦盖提县| 平武县| 大名县| 合阳县| 资兴市| 北宁市| 灵山县|