官术网_书友最值得收藏!

Reinforcement learning

The last type of learning in machine learning is reinforcement learning, in which there's no supervisor but only a reward signal.

So the reinforcement learning algorithm will try to make a decision and then a reward signal will be there to tell whether this decision is right or wrong. Also, this supervision feedback or reward signal may not come instantaneously but get delayed for a few steps. For example, the algorithm will take a decision now, but only after many steps will the reward signal tell whether decision was good or bad.

主站蜘蛛池模板: 朝阳区| 高安市| 琼海市| 西昌市| 嘉义市| 桂阳县| 静海县| 武威市| 新密市| 蚌埠市| 阿拉尔市| 泾阳县| 龙陵县| 甘孜| 忻城县| 东乡| 新安县| 清苑县| 江永县| 思南县| 靖边县| 肇源县| 防城港市| 建宁县| 咸宁市| 彭水| 禹城市| 邳州市| 枞阳县| 长沙县| 万荣县| 昌乐县| 商南县| 马鞍山市| 焦作市| 老河口市| 宁城县| 双流县| 普洱| 宜章县| 新邵县|