官术网_书友最值得收藏!

Cliff walking and grid world problems

Let's consider cliff walking and grid world problems. First, we will introduce these problems to you, then we will proceed on to the coding part. For both problems, we consider a rectangular grid with nrows (number of rows) and ncols (number of columns). We start from one cell to the south of the bottom left cell, and the goal is to reach the destination, which is one cell to the south of the bottom right cell.

Note that the start and destination cells are not part of the nrows x ncols grid of cells. For the cliff walking problem, the cells to the south of the bottom row of cells, except for the start and destination cells, form a cliff where, if the agent enters, the episode ends with catastrophic fall into the cliff. Likewise, if the agent tries to leave the left, top, or right boundaries of the grid of cells, it is placed back in the same cell, that is, it is equivalent to taking no action.

For the grid world problem, we do not have a cliff, but we have obstacles inside the grid world. If the agent tries to enter any of these obstacle cells, it is bounced back to the same cell from which it came. In both these problems, the goal is to find the optimum path from the start to the destination.

So, let's dive on in!

主站蜘蛛池模板: 嵩明县| 红原县| 石棉县| 邢台县| 新邵县| 金华市| 潍坊市| 洛南县| 闽侯县| 彝良县| 泽普县| 团风县| 龙陵县| 西吉县| 鞍山市| 孝义市| 遵义市| 临城县| 班玛县| 甘孜县| 凤山市| 滦平县| 碌曲县| 苏尼特右旗| 淳安县| 广河县| 平果县| 开封市| 平和县| 偏关县| 襄垣县| 商都县| 和静县| 南部县| 缙云县| 凤翔县| 武隆县| 安泽县| 十堰市| 铜梁县| 奈曼旗|