Value function
A value function denotes how good it is for an agent to be in a particular state. It is dependent on the policy and is often denoted by v(s). It is equal to the total expected reward received by the agent starting from the initial state. There can be several value functions; the optimal value function is the one that has the highest value for all the states compared to other value functions. Similarly, an optimal policy is the one that has the optimal value function.
推薦閱讀
- LibGDX Game Development Essentials
- 數(shù)據(jù)庫(kù)應(yīng)用實(shí)戰(zhàn)
- 數(shù)據(jù)挖掘原理與實(shí)踐
- 數(shù)據(jù)之巔:數(shù)據(jù)的本質(zhì)與未來(lái)
- SQL Server 2008數(shù)據(jù)庫(kù)應(yīng)用技術(shù)(第二版)
- 商業(yè)分析思維與實(shí)踐:用數(shù)據(jù)分析解決商業(yè)問(wèn)題
- 深入淺出MySQL:數(shù)據(jù)庫(kù)開(kāi)發(fā)、優(yōu)化與管理維護(hù)(第2版)
- 城市計(jì)算
- 數(shù)據(jù)庫(kù)技術(shù)實(shí)用教程
- “互聯(lián)網(wǎng)+”時(shí)代立體化計(jì)算機(jī)組
- 圖數(shù)據(jù)實(shí)戰(zhàn):用圖思維和圖技術(shù)解決復(fù)雜問(wèn)題
- 淘寶、天貓電商數(shù)據(jù)分析與挖掘?qū)崙?zhàn)(第2版)
- Hadoop集群與安全
- Python數(shù)據(jù)分析從小白到專家
- 商業(yè)智能工具應(yīng)用與數(shù)據(jù)可視化