官术网_书友最值得收藏!

Overview of Keras Reinforcement Learning

Nowadays, most computers are based on a symbolic elaboration, that is, the problem is first encoded in a set of variables and then processed using an explicit algorithm that, for each possible input of the problem, offers an adequate output. However, there are problems in which resolution with an explicit algorithm is inefficient or even unnatural, for example with a speech recognizer; tackling this kind of problem with the classic approach is inefficient. This and other similar problems, such as autonomous navigation of a robot or voice assistance in performing an operation, are part of a very diverse set of problems that can be addressed directly through solutions based on reinforcement learning.

Reinforcement learning is a very exciting part of machine learning, used in applications ranging from autonomous cars to playing games. Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. To do this, we use external feedback signals (reward signals) generated by the environment according to the choices made by the algorithm. A correct choice will result in a reward, while an incorrect choice will lead to a penalization of the system. All of this is in order to achieve the best result obtainable.

The topics covered in this chapter are the following:

  • An overview of machine learning
  • Reinforcement learning
  • Markov Decision Process (MDP)
  • Temporal difference (TD) learning
  • Q-learning
  • Deep Q-learning networks

At the end of the chapter, you will be fully introduced to the power of reinforcement learning and will learn the different approaches to this technique. Several reinforcement learning methods will be covered.

主站蜘蛛池模板: 龙岩市| 新津县| 青阳县| 社会| 泉州市| 阿克苏市| 论坛| 湘潭县| 台江县| 涿州市| 磐石市| 湘潭市| 罗平县| 宁安市| 石渠县| 红桥区| 股票| 庐江县| 鹤峰县| 沙湾县| 尉氏县| 那坡县| 临高县| 即墨市| 洛阳市| 南充市| 怀化市| 永济市| 安岳县| 佛山市| 岗巴县| 西平县| 长汀县| 金坛市| 雷州市| 拉萨市| 通山县| 商城县| 安顺市| 新民市| 隆化县|