官术网_书友最值得收藏!

Value function

A value function denotes how good it is for an agent to be in a particular state. It is dependent on the policy and is often denoted by v(s). It is equal to the total expected reward received by the agent starting from the initial state. There can be several value functions; the optimal value function is the one that has the highest value for all the states compared to other value functions. Similarly, an optimal policy is the one that has the optimal value function.

主站蜘蛛池模板: 谢通门县| 新兴县| 方城县| 东光县| 平塘县| 肃南| 桑日县| 伊川县| 云龙县| 调兵山市| 武城县| 德格县| 理塘县| 辰溪县| 简阳市| 甘谷县| 曲周县| 长海县| 万安县| 上犹县| 泽库县| 西和县| 海盐县| 昌黎县| 桦川县| 赤峰市| 汤阴县| 玛纳斯县| 永修县| 南溪县| 安远县| 林西县| 襄汾县| 勐海县| 绥滨县| 铜川市| 天门市| 绿春县| 金寨县| 襄垣县| 安国市|