Deep deterministic policy gradient
- Python Reinforcement Learning
- Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
- 684字
- 2021-06-24 15:17:57
上QQ閱讀APP看后續(xù)精彩內(nèi)容
登錄訂閱本章 >
推薦閱讀
- 企業(yè)數(shù)字化創(chuàng)新引擎:企業(yè)級(jí)PaaS平臺(tái)HZERO
- ETL數(shù)據(jù)整合與處理(Kettle)
- 數(shù)據(jù)庫(kù)技術(shù)與應(yīng)用教程(Access)
- Test-Driven Development with Mockito
- 商業(yè)分析思維與實(shí)踐:用數(shù)據(jù)分析解決商業(yè)問題
- 深度剖析Hadoop HDFS
- 數(shù)據(jù)架構(gòu)與商業(yè)智能
- Microsoft Power BI數(shù)據(jù)可視化與數(shù)據(jù)分析
- 數(shù)據(jù)分析師養(yǎng)成寶典
- 區(qū)域云計(jì)算和大數(shù)據(jù)產(chǎn)業(yè)發(fā)展:浙江樣板
- 爬蟲實(shí)戰(zhàn):從數(shù)據(jù)到產(chǎn)品
- Mastering ROS for Robotics Programming(Second Edition)
- Spring Boot 2.0 Cookbook(Second Edition)
- 企業(yè)大數(shù)據(jù)處理:Spark、Druid、Flume與Kafka應(yīng)用實(shí)踐
- 從Lucene到Elasticsearch:全文檢索實(shí)戰(zhàn)