官术网_书友最值得收藏!

Control processes

An MDP is itself a type of problem called a control process. Broadly speaking, a control process is designed to optimize a value or a set of values within a set of limitations. The following diagram shows a model for a generalized control process:

A real-world example of such an optimization process might be maximizing a company's profit by knowing its gross sales and variable expenses. We might want to keep expenses as low as possible without affecting the sales, and keep the sales as high as possible on a scale that we can manage without incurring too many new expenses. 

An MDP models a set of states, a set of actions that can be taken from each state, and the outcome of taking each action. As the agent making the decisions in an MDP, part of the outcome of each action that we take is under our control, and part of it is random and unknown to us. 

We will not go too deeply into the mathematical details of MDPs, but this information is worth digging into if you want a better understanding of how stochastic processes can be modeled in the context of Bayesian statistics, such as in Monte Carlo problems.

We will link to some useful resources on the mathematics of MDPs at the end of the chapter. For now, it will be sufficient to get an intuitive and visual understanding of them.

主站蜘蛛池模板: 合川市| 兴安县| 塘沽区| 英吉沙县| 塔城市| 衡东县| 康平县| 义乌市| 黄陵县| 通化市| 丰顺县| 台北县| 莱州市| 昌都县| 盐山县| 宁都县| 雅安市| 和田市| 东乌| 隆林| 墨脱县| 理塘县| 勐海县| 汉川市| 孝昌县| 盐源县| 兴宁市| 怀集县| 内乡县| 农安县| 保定市| 酉阳| 丹凤县| 长寿区| 绥滨县| 鄂尔多斯市| 邹城市| 望江县| 剑川县| 邳州市| 平原县|