官术网_书友最值得收藏!

How it works...

The random search algorithm works so well mainly because of the simplicity of our CartPole environment. Its observation state is composed of only four variables. You will recall that the observation in the Atari Space Invaders game is more than 100,000 (which is 210 * 160 * 3)  . The number of dimensions of the action state in CartPole is a third of that in Space Invaders. In general, simple algorithms work well for simple problems. In our case, we simply search for the best linear mapping from the observation to the action from a random pool.

Another interesting thing we've noticed is that before we select and deploy the best policy (the best linear mapping), random search also outperforms random action. This is because random linear mapping does take the observations into consideration. With more information from the environment, the decisions made in the random search policy are more intelligent than completely random ones.

主站蜘蛛池模板: 通州区| 汤原县| 余庆县| 祥云县| 凤山县| 西丰县| 方城县| 衡水市| 广丰县| 新宾| 凯里市| 芮城县| 资溪县| 柳河县| 九江市| 高安市| 金寨县| 长葛市| 自治县| 康保县| 通河县| 南昌县| 宁阳县| 河源市| 沙河市| 江华| 江口县| 当阳市| 微博| 西充县| 绿春县| 邵东县| 武陟县| 阿克陶县| 隆化县| 广汉市| 莱芜市| 商南县| 上栗县| 洱源县| 绥中县|