Monte Carlo policy gradients with REINFORCE
- Python Deep Learning
- Ivan Vasilev Daniel Slater Gianmario Spacagna Peter Roelants Valentino Zocca
- 379字
- 2021-07-02 14:31:40
上QQ閱讀APP看后續(xù)精彩內容
登錄訂閱本章 >
推薦閱讀
- 深度實踐OpenStack:基于Python的OpenStack組件開發(fā)
- 測試驅動開發(fā):入門、實戰(zhàn)與進階
- SpringMVC+MyBatis快速開發(fā)與項目實戰(zhàn)
- HTML5 Mobile Development Cookbook
- Microsoft System Center Orchestrator 2012 R2 Essentials
- Python算法從菜鳥到達人
- Python算法指南:程序員經(jīng)典算法分析與實現(xiàn)
- Building Machine Learning Systems with Python(Second Edition)
- Python計算機視覺和自然語言處理
- Practical GIS
- Go語言從入門到精通
- 數(shù)據(jù)分析與挖掘算法:Python實戰(zhàn)
- WCF技術剖析(卷1)
- 數(shù)據(jù)結構與算法詳解
- OpenStack Sahara Essentials