- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 112字
- 2021-06-24 15:17:21
RL algorithm
The steps involved in typical RL algorithm are as follows:
- First, the agent interacts with the environment by performing an action
- The agent performs an action and moves from one state to another
- And then the agent will receive a reward based on the action it performed
- Based on the reward, the agent will understand whether the action was good or bad
- If the action was good, that is, if the agent received a positive reward, then the agent will prefer performing that action or else the agent will try performing an other action which results in a positive reward. So it is basically a trial and error learning process
推薦閱讀
- 數據挖掘原理與實踐
- Hands-On Machine Learning with Microsoft Excel 2019
- Java Data Science Cookbook
- 文本數據挖掘:基于R語言
- Oracle高性能自動化運維
- 計算機應用基礎教程上機指導與習題集(微課版)
- Construct 2 Game Development by Example
- Mastering LOB Development for Silverlight 5:A Case Study in Action
- 改變未來的九大算法
- 菜鳥學SPSS數據分析
- 大數據技術原理與應用:概念、存儲、處理、分析與應用
- Access 2010數據庫程序設計實踐教程
- 云計算
- 數據賦能
- 算力經濟:從超級計算到云計算