官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.

In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).

The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.

主站蜘蛛池模板: 仁寿县| 司法| 景泰县| 晴隆县| 永康市| 晋宁县| 定南县| 历史| 宜昌市| 开封县| 奉贤区| 江达县| 阜新市| 田阳县| 石楼县| 吉安市| 大冶市| 余庆县| 苍梧县| 长垣县| 永年县| 临泽县| 上杭县| 鹤壁市| 文安县| 南平市| 枣阳市| 安顺市| 获嘉县| 阳城县| 哈尔滨市| 乌拉特中旗| 通榆县| 壤塘县| 黔江区| 眉山市| 镇坪县| 澄江县| 龙南县| 永安市| 石泉县|