- Hands-On Reinforcement Learning with Python
- Sudharsan Ravichandiran
- 112字
- 2021-06-18 19:12:01
RL algorithm
The steps involved in typical RL algorithm are as follows:
- First, the agent interacts with the environment by performing an action
- The agent performs an action and moves from one state to another
- And then the agent will receive a reward based on the action it performed
- Based on the reward, the agent will understand whether the action was good or bad
- If the action was good, that is, if the agent received a positive reward, then the agent will prefer performing that action or else the agent will try performing an other action which results in a positive reward. So it is basically a trial and error learning process
推薦閱讀
- ASP.NET Core:Cloud-ready,Enterprise Web Application Development
- Vue.js設(shè)計(jì)與實(shí)現(xiàn)
- JavaScript百煉成仙
- Spring 5.0 Microservices(Second Edition)
- Rust實(shí)戰(zhàn)
- 面向STEM的Scratch創(chuàng)新課程
- Swift 3 New Features
- C語言從入門到精通(第4版)
- PostgreSQL 11從入門到精通(視頻教學(xué)版)
- Scala謎題
- Ext JS 4 Web Application Development Cookbook
- Windows Phone 7.5:Building Location-aware Applications
- MySQL程序員面試筆試寶典
- Illustrator CC平面設(shè)計(jì)實(shí)戰(zhàn)從入門到精通(視頻自學(xué)全彩版)
- IPython Interactive Computing and Visualization Cookbook