書名: Python Reinforcement Learning作者名: Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo本章字數: 71字更新時間: 2021-06-24 15:17:34
Questions
The question list is as follows:
- What is the Markov property?
- Why do we need the Markov Decision Process?
- When do we prefer immediate rewards?
- What is the use of the discount factor?
- Why do we use the Bellman function?
- How would you derive the Bellman equation for a Q function?
- How are the value function and Q function related?
- What is the difference between value iteration and policy iteration?
推薦閱讀
- Greenplum:從大數據戰略到實現
- 數據挖掘原理與實踐
- Python絕技:運用Python成為頂級數據工程師
- 信息系統與數據科學
- 劍破冰山:Oracle開發藝術
- Voice Application Development for Android
- 計算機信息技術基礎實驗與習題
- SQL查詢:從入門到實踐(第4版)
- Hadoop大數據實戰權威指南(第2版)
- Starling Game Development Essentials
- 基于OPAC日志的高校圖書館用戶信息需求與檢索行為研究
- Python數據分析與挖掘實戰(第3版)
- Instant Autodesk AutoCAD 2014 Customization with .NET
- Splunk智能運維實戰
- Scratch 2.0 Game Development HOTSHOT