官术网_书友最值得收藏!

  • Deep Learning By Example
  • Ahmed Menshawy
  • 94字
  • 2021-06-24 18:52:39

Reinforcement learning

The last type of learning in machine learning is reinforcement learning, in which there's no supervisor but only a reward signal.

So the reinforcement learning algorithm will try to make a decision and then a reward signal will be there to tell whether this decision is right or wrong. Also, this supervision feedback or reward signal may not come instantaneously but get delayed for a few steps. For example, the algorithm will take a decision now, but only after many steps will the reward signal tell whether decision was good or bad.

主站蜘蛛池模板: 开江县| 彭州市| 宁安市| 安溪县| 眉山市| 上犹县| 同仁县| 玉屏| 虹口区| 高要市| 通江县| 武强县| 乐东| 梨树县| 兴安盟| 怀化市| 咸阳市| 峨边| 安溪县| 岑巩县| 民县| 昂仁县| 聊城市| 庆元县| 温宿县| 石楼县| 嘉荫县| 两当县| 宁陵县| 将乐县| 福海县| 安溪县| 扶沟县| 沙坪坝区| 庐江县| 商南县| 册亨县| 桂阳县| 山阳县| 吉安县| 和平区|