官术网_书友最值得收藏!

Reinforcement learning

The last type of learning in machine learning is reinforcement learning, in which there's no supervisor but only a reward signal.

So the reinforcement learning algorithm will try to make a decision and then a reward signal will be there to tell whether this decision is right or wrong. Also, this supervision feedback or reward signal may not come instantaneously but get delayed for a few steps. For example, the algorithm will take a decision now, but only after many steps will the reward signal tell whether decision was good or bad.

主站蜘蛛池模板: 穆棱市| 紫云| 根河市| 诏安县| 青冈县| 麟游县| 花莲市| 安图县| 建阳市| 上饶县| 福建省| 利川市| 高密市| 图木舒克市| 锦屏县| 康乐县| 若羌县| 屏边| 太白县| 灵石县| 汤阴县| 宝兴县| 库尔勒市| 桦南县| 榕江县| 开平市| 大英县| 田阳县| 南康市| 远安县| 房山区| 临沂市| 镇康县| 石屏县| 通海县| 博白县| 铜山县| 永平县| 鄢陵县| 高碑店市| 六枝特区|