官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.

In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).

The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.

主站蜘蛛池模板: 桑植县| 南靖县| 朝阳区| 宜兰市| 永德县| 新邵县| 嘉兴市| 湘西| 句容市| 镇安县| 剑河县| 岳池县| 德阳市| 台江县| 涞水县| 黄骅市| 五河县| 乐陵市| 巧家县| 呼玛县| 从江县| 巍山| 巴南区| 黎川县| 松阳县| 泊头市| 南昌县| 东光县| 哈巴河县| 弋阳县| 海林市| 岗巴县| 宝丰县| 伊宁市| 祁东县| 新宁县| 胶州市| 治县。| 宣汉县| 芜湖市| 民丰县|