官术网_书友最值得收藏!

Reinforcement learning

Reinforcement learning is special in the sense that it doesn't require a dataset (see the following diagram). Instead, it involves an agent who takes actions, changing the state of the environment. After each step, it gets a reward or punishment, depending on the state and previous actions. The goal is to obtain a maximum cumulative reward. It can be used to teach the computer to play video games or drive a car. If you think about it, reinforcement learning is the way our pets train us humans: by rewarding our actions with tail-wagging, or punishing with scratched furniture.

One of the central topics in reinforcement learning is the exploration-exploitation dilemma—how to find a good balance between exploring new options and using what is already known:

Figure 1.3: Reinforcement learning process

Table 1.3: ML tasks:

主站蜘蛛池模板: 都昌县| 皋兰县| 黄龙县| 乌恰县| 得荣县| 湘乡市| 舒兰市| 马关县| 尉氏县| 保定市| 新津县| 武清区| 宁乡县| 武清区| 龙海市| 澜沧| 茂名市| 突泉县| 渝中区| 黔南| 清丰县| 镇原县| 黔东| 应用必备| 鲜城| 宝应县| 林州市| 江门市| 会昌县| 宜春市| 榆树市| 盖州市| 右玉县| 清水县| 渭源县| 安溪县| 景宁| 吴堡县| 伊川县| 卢氏县| 大名县|