- TensorFlow Reinforcement Learning Quick Start Guide
- Kaushik Balakrishnan
- 35字
- 2021-06-24 15:29:07
On-policy versus off-policy learning
RL algorithms can be classified as on-policy or off-policy. We will now learn about both of these classes and how to distinguish a given RL algorithm into one or the other.
推薦閱讀
- AutoCAD繪圖實用速查通典
- 21天學(xué)通JavaScript
- 空間機器人遙操作系統(tǒng)及控制
- Hands-On Data Science with SQL Server 2017
- 機艙監(jiān)測與主機遙控
- 計算機網(wǎng)絡(luò)技術(shù)基礎(chǔ)
- 網(wǎng)絡(luò)組建與互聯(lián)
- 步步圖解自動化綜合技能
- 傳感器與新聞
- Machine Learning with Apache Spark Quick Start Guide
- Statistics for Data Science
- 貫通Java Web輕量級應(yīng)用開發(fā)
- 算法設(shè)計與分析
- 樂高創(chuàng)意機器人教程(中級 上冊 10~16歲) (青少年iCAN+創(chuàng)新創(chuàng)意實踐指導(dǎo)叢書)
- 計算機硬件技術(shù)基礎(chǔ)學(xué)習(xí)指導(dǎo)與練習(xí)