- Keras Reinforcement Learning Projects
- Giuseppe Ciaburro
- 154字
- 2021-08-13 15:26:02
Reinforcement learning
Reinforcement learning aims to create algorithms that can learn and adapt to environmental changes. This programming technique is based on the concept of receiving external stimuli, the nature of which depends on the algorithm choices. A correct choice will involve a reward, while an incorrect choice will lead to a penalty. The goal of the system is to achieve the best possible result, of course.
In supervised learning, there is a teacher that tells the system the correct output (learning with a teacher). This is not always possible. Often, we have only qualitative information (sometimes binary, right/wrong, or success/failure).
The information available is called reinforcement signals. But the system does not give any information on how to update the agent's behavior (that is, weights). You cannot define a cost function or a gradient. The goal of the system is to create smart agents that have machinery able to learn from their experience.
- 21天學(xué)通JavaScript
- Drupal 7 Multilingual Sites
- 嵌入式Linux上的C語(yǔ)言編程實(shí)踐
- 基于多目標(biāo)決策的數(shù)據(jù)挖掘方法評(píng)估與應(yīng)用
- 統(tǒng)計(jì)學(xué)習(xí)理論與方法:R語(yǔ)言版
- Excel 2007常見(jiàn)技法與行業(yè)應(yīng)用實(shí)例精講
- TensorFlow Reinforcement Learning Quick Start Guide
- 液壓機(jī)智能故障診斷方法集成技術(shù)
- Salesforce Advanced Administrator Certification Guide
- ZigBee無(wú)線通信技術(shù)應(yīng)用開(kāi)發(fā)
- 貫通開(kāi)源Web圖形與報(bào)表技術(shù)全集
- Deep Learning Essentials
- 自適應(yīng)學(xué)習(xí):人工智能時(shí)代的教育革命
- 從機(jī)器學(xué)習(xí)到無(wú)人駕駛
- Hands-On Agile Software Development with JIRA