- Keras Reinforcement Learning Projects
- Giuseppe Ciaburro
- 290字
- 2021-08-13 15:26:04
Reinforcement learning algorithms
As we have seen in the previous sections, reinforcement learning is a programming technique that aims to develop algorithms that can learn and adapt to changes in the environment. This programming technique is based on the assumption of the agent being able to receive stimuli from the outside and to change its actions according to these stimuli. So, a correct choice will result in a reward while an incorrect choice will lead to a penalization of the system.
The goal of the system is to achieve the highest possible reward and consequently the best possible result. This result can be obtained through two approaches:
- The first approach involves evaluating the choices of the algorithm and then rewarding or punishing the algorithm based on the result. These techniques can also adapt to substantial changes in the environment. An example is the image recognition programs that improve their performance with use. In this case we can say that learning takes place continuously.
- In the second approach, a first phase is applied in which the algorithm is previously trained, and when the system is considered reliable, it is crystallized and no longer modifiable. This derives from the observation that constantly evaluating the actions of the algorithm can be a process that cannot be automated or that is very expensive.
These are only implementation choices, so it may happen that an algorithm includes the newly analyzed approaches.
So far, we have introduced the basic concepts of reinforcement learning. Now, we can analyze the various ways in which these concepts have been transformed into algorithms. In this section, we will list them, providing an overview, and we will deepen them in the practical cases that we will address in the following chapters.
- Oracle SOA Governance 11g Implementation
- Java實(shí)用組件集
- Visual FoxPro 6.0數(shù)據(jù)庫與程序設(shè)計(jì)
- Windows XP中文版應(yīng)用基礎(chǔ)
- UTM(統(tǒng)一威脅管理)技術(shù)概論
- 最后一個(gè)人類
- 西門子S7-200 SMART PLC實(shí)例指導(dǎo)學(xué)與用
- 網(wǎng)絡(luò)化分布式系統(tǒng)預(yù)測控制
- 格蠹匯編
- AI的25種可能
- Ansible 2 Cloud Automation Cookbook
- Mastering Ansible(Second Edition)
- MongoDB 4 Quick Start Guide
- 計(jì)算機(jī)應(yīng)用基礎(chǔ)實(shí)訓(xùn)(職業(yè)模塊)
- 網(wǎng)頁設(shè)計(jì)與制作