官术网_书友最值得收藏!

Summary

In this chapter, we covered the building blocks, such as shallow and deep neural networks that included logistic regression, single hidden layer neural network, RNNs, LSTMs, CNNs, and their other variations. Catering to the these topics, we also covered multiple activation functions, how forward and backward propagation works, and the problems associated with the training of deep neural networks, such as vanishing and exploding gradients.

Then, we covered the very basic terminologies in reinforcement learning that we will explore in detail in the coming chapters. These were the optimality criteria, which are value function and policy. We also gained an understanding of some reinforcement learning algorithms, such as Q-learning and A3C algorithms. Then, we covered some basic computations in the TensorFlow framework, an introduction to OpenAI Gym, and also discussed some of the influential pioneers and research breakthroughs in the field of reinforcement learning.

In the following chapter, we will implement a basic reinforcement learning algorithm to a couple of OpenAI Gym framework environments and get a better understanding of OpenAI Gym.

主站蜘蛛池模板: 遂川县| 洛阳市| 洛阳市| 蕲春县| 咸阳市| 乡城县| 屏南县| 仲巴县| 兰溪市| 嵩明县| 磴口县| 安多县| 中阳县| 三明市| 舟山市| 克什克腾旗| 资兴市| 和静县| 林周县| 体育| 册亨县| 长泰县| 嘉荫县| 长治县| 宁明县| 军事| 新巴尔虎左旗| 禹州市| 九江县| 宁陵县| 青海省| 浮梁县| 伊川县| 祁东县| 修武县| 陆丰市| 什邡市| 利川市| 上思县| 衢州市| 姜堰市|