書名： Hands-On Reinforcement Learning with Python
作者名： Sudharsan Ravichandiran
本章字?jǐn)?shù)： 104字
更新時(shí)間： 2021-06-18 19:12:01

Policy function

A policy defines the agent's behavior in an environment. The way in which the agent decides which action to perform depends on the policy. Say you want to reach your office from home; there will be different routes to reach your office, and some routes are shortcuts, while some routes are long. These routes are called policies because they represent the way in which we choose to perform an action to reach our goal. A policy is often denoted by the symbol ??. A policy can be in the form of a lookup table or a complex search process.

官术网_书友最值得收藏!

Hands-On Reinforcement Learning with Python

Policy function