官术网_书友最值得收藏!

Reinforcement learning

Remember how you learned to ride a bicycle in your childhood? It was a trial and error process, right? You tried to balance yourself, and each time you did something wrong, you tipped off the bicycle. But, you learned from your mistakes, and eventually, you were able to ride without falling. In the same way, Reinforcement learning does the same! An agent is exposed to an environment where it takes action from a list of possible actions, which leads to a change in the state of the agent. A state is the current situation of the environment the agent is in. For every action, the agent receives an award. Whenever the received reward is positive, it signifies the agent has taken the correct step, and when the reward is negative, it signifies a mistake. The agent follows a policy, a reinforcement learning algorithm through which the agent determines next actions considering the current state. Reinforcement learning is the true form of artificial intelligence, inspired by a human's way of learning through trial and error. Think of yourself as the agent and the bicycle the environment! Discussing reinforcement learning algorithms here is beyond the scope of this book, so let's shift focus back to deep learning!

主站蜘蛛池模板: 怀仁县| 南漳县| 呼图壁县| 萨嘎县| 汉寿县| 民勤县| 广丰县| 衡南县| 浪卡子县| 仁化县| 花莲县| 永宁县| 苏尼特右旗| 资溪县| 保德县| 大渡口区| 沛县| 五台县| 屏边| 平泉县| 上林县| 开平市| 乌兰察布市| 绥芬河市| 嫩江县| 岳普湖县| 玉树县| 同江市| 克什克腾旗| 宁波市| 博罗县| 东乡族自治县| 肇州县| 焉耆| 玛沁县| 武宣县| 宁安市| 阿拉善右旗| 奉节县| 南溪县| 富锦市|