官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.

In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).

The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.

主站蜘蛛池模板: 麦盖提县| 海安县| 临颍县| 西平县| 林州市| 肃北| 惠州市| 宁安市| 托克托县| 贵德县| 宁都县| 宁武县| 太仆寺旗| 尖扎县| 胶南市| 宁明县| 昭通市| 云南省| 旺苍县| 嘉峪关市| 平遥县| 乌拉特后旗| 宁陵县| 西畴县| 偏关县| 怀化市| 三河市| 桂林市| 安国市| 新邵县| 体育| 西丰县| 凌云县| 巴中市| 洛南县| 神池县| 三原县| 裕民县| 中江县| 阳信县| 莱西市|