- PyTorch 1.x Reinforcement Learning Cookbook
- Yuxi (Hayden) Liu
- 160字
- 2021-06-24 12:34:41
How it works...
The random search algorithm works so well mainly because of the simplicity of our CartPole environment. Its observation state is composed of only four variables. You will recall that the observation in the Atari Space Invaders game is more than 100,000 (which is 210 * 160 * 3) . The number of dimensions of the action state in CartPole is a third of that in Space Invaders. In general, simple algorithms work well for simple problems. In our case, we simply search for the best linear mapping from the observation to the action from a random pool.
Another interesting thing we've noticed is that before we select and deploy the best policy (the best linear mapping), random search also outperforms random action. This is because random linear mapping does take the observations into consideration. With more information from the environment, the decisions made in the random search policy are more intelligent than completely random ones.
- 零起步輕松學單片機技術(第2版)
- ABB工業機器人編程全集
- 樂高機器人:WeDo編程與搭建指南
- Seven NoSQL Databases in a Week
- Getting Started with Clickteam Fusion
- 西門子S7-200 SMART PLC從入門到精通
- STM32G4入門與電機控制實戰:基于X-CUBE-MCSDK的無刷直流電機與永磁同步電機控制實現
- 電腦主板現場維修實錄
- Troubleshooting OpenVPN
- Implementing AWS:Design,Build,and Manage your Infrastructure
- Lightning Fast Animation in Element 3D
- Learn CloudFormation
- 電腦日常使用與維護322問
- R Machine Learning Projects
- 在實戰中成長:C++開發之路