官术网_书友最值得收藏!

Programming an agent using an OpenAI Gym environment

The environment considered for this section is the Frozen Lake v0. The actual documentation of the concerned environment can be found at https://gym.openai.com/envs/FrozenLake-v0/.

This environment consists of 4 x 4 grids representing a lake. Thus, we have 16 grid blocks, where each block can be a start block(S), frozen block(F), goal block(G), or a hole block(H). Thus, the objective of the agent is to learn to navigate from start to goal without falling in the hole:

import Gym
env = Gym.make('FrozenLake-v0') #loads the environment FrozenLake-v0
env.render() # will output the environment and position of the agent

-------------------
S
FFF FHFH FFFH HFFG

At any given state, an agent has four actions to perform, which are up, down, left, and right. The reward at each step is 0 except the one leading to the goal state, then the reward would be 1. We start from the S state and our goal is to reach the G state without landing up in the H state in the most optimized path through the F states.

主站蜘蛛池模板: 松江区| 辽源市| 乌海市| 新丰县| 屏边| 衢州市| 神农架林区| 凉城县| 张掖市| 大田县| 公主岭市| 广德县| 溧阳市| 武山县| 平乐县| 峡江县| 隆德县| 嘉善县| 凯里市| 霸州市| 兰坪| 阳城县| 浦县| 石首市| 娄底市| 连平县| 广宁县| 云安县| 永安市| 营口市| 花垣县| 鄂伦春自治旗| 蒙山县| 桦南县| 大名县| 平武县| 高陵县| 通辽市| 安国市| 益阳市| 新平|