- Hands-On Q-Learning with Python
- Nazia Habib
- 61字
- 2021-06-24 15:13:14
Demystifying MDPs
The technical purpose of Q-learning is to discover solutions for a type of optimization problem called an MDP.
When we talk about states and the actions that we can take from states, we are discussing concepts developed in the context of MDPs (and the Markov chains and other state space models that they are derived from).
推薦閱讀
- Design for the Future
- 蕩胸生層云:C語言開發(fā)修行實錄
- WOW!Illustrator CS6完全自學寶典
- Hands-On Cloud Solutions with Azure
- 人工智能工程化:應用落地與中臺構(gòu)建
- Zabbix Network Monitoring(Second Edition)
- Blender Compositing and Post Processing
- 中國戰(zhàn)略性新興產(chǎn)業(yè)研究與發(fā)展·智能制造
- 氣動系統(tǒng)裝調(diào)與PLC控制
- RedHat Linux用戶基礎
- Hands-On Data Warehousing with Azure Data Factory
- 智能鼠原理與制作(進階篇)
- Access 2007數(shù)據(jù)庫入門與實例應用金典
- ARM體系結(jié)構(gòu)與編程
- Microsoft Office 365:Exchange Online Implementation and Migration(Second Edition)