Chapter 11: Policy Gradients and Optimization
- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 147字
- 2021-06-24 15:18:32
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- 輕松學(xué)大數(shù)據(jù)挖掘:算法、場景與數(shù)據(jù)產(chǎn)品
- 數(shù)據(jù)化網(wǎng)站運營深度剖析
- Neural Network Programming with TensorFlow
- 大數(shù)據(jù)時代下的智能轉(zhuǎn)型進(jìn)程精選(套裝共10冊)
- 3D計算機視覺:原理、算法及應(yīng)用
- 城市計算
- Starling Game Development Essentials
- 大數(shù)據(jù)架構(gòu)商業(yè)之路:從業(yè)務(wù)需求到技術(shù)方案
- Solaris操作系統(tǒng)原理實驗教程
- MySQL技術(shù)內(nèi)幕:SQL編程
- Augmented Reality using Appcelerator Titanium Starter
- 數(shù)據(jù)庫與數(shù)據(jù)處理:Access 2010實現(xiàn)
- Node.js High Performance
- Deep Learning with R for Beginners
- PostgreSQL高可用實戰(zhàn)