- Reinforcement Learning with TensorFlow
- Sayon Dutta
- 173字
- 2021-08-27 18:51:59
Summary
In this chapter, we covered the building blocks, such as shallow and deep neural networks that included logistic regression, single hidden layer neural network, RNNs, LSTMs, CNNs, and their other variations. Catering to the these topics, we also covered multiple activation functions, how forward and backward propagation works, and the problems associated with the training of deep neural networks, such as vanishing and exploding gradients.
Then, we covered the very basic terminologies in reinforcement learning that we will explore in detail in the coming chapters. These were the optimality criteria, which are value function and policy. We also gained an understanding of some reinforcement learning algorithms, such as Q-learning and A3C algorithms. Then, we covered some basic computations in the TensorFlow framework, an introduction to OpenAI Gym, and also discussed some of the influential pioneers and research breakthroughs in the field of reinforcement learning.
In the following chapter, we will implement a basic reinforcement learning algorithm to a couple of OpenAI Gym framework environments and get a better understanding of OpenAI Gym.
- AutoCAD快速入門與工程制圖
- 21小時學(xué)通AutoCAD
- Python Artificial Intelligence Projects for Beginners
- 精通Windows Vista必讀
- 大數(shù)據(jù)時代的數(shù)據(jù)挖掘
- 精通Excel VBA
- 現(xiàn)代傳感技術(shù)
- 精通數(shù)據(jù)科學(xué)算法
- Visual FoxPro數(shù)據(jù)庫基礎(chǔ)及應(yīng)用
- 大數(shù)據(jù)驅(qū)動的機(jī)械裝備智能運(yùn)維理論及應(yīng)用
- 貫通Java Web開發(fā)三劍客
- Salesforce for Beginners
- 電氣控制與PLC原理及應(yīng)用(歐姆龍機(jī)型)
- 在實(shí)戰(zhàn)中成長:C++開發(fā)之路
- 基于Proteus的單片機(jī)應(yīng)用技術(shù)