書名： Hands-On Neural Networks
作者名： Leonardo De Marchi Laura Mitchell
本章字數： 92字
更新時間： 2021-06-24 14:00:09

Reinforcement learning

Reinforcement learning (RL) is the most distinct category, with respect to the one we saw so far. The concept is quite fascinating: the algorithm is trying to find a policy to maximize the sum of rewards.

The policy is learned by an agent who uses it to take actions in an environment. The environment then returns feedback, which the agent uses to improve its policy. The feedback is the reward for the action taken and it can be a positive, null, or negative number, as shown in the following diagram:

官术网_书友最值得收藏!

Hands-On Neural Networks

Reinforcement learning