官术网_书友最值得收藏!

Summary

In this chapter, we covered the building blocks, such as shallow and deep neural networks that included logistic regression, single hidden layer neural network, RNNs, LSTMs, CNNs, and their other variations. Catering to the these topics, we also covered multiple activation functions, how forward and backward propagation works, and the problems associated with the training of deep neural networks, such as vanishing and exploding gradients.

Then, we covered the very basic terminologies in reinforcement learning that we will explore in detail in the coming chapters. These were the optimality criteria, which are value function and policy. We also gained an understanding of some reinforcement learning algorithms, such as Q-learning and A3C algorithms. Then, we covered some basic computations in the TensorFlow framework, an introduction to OpenAI Gym, and also discussed some of the influential pioneers and research breakthroughs in the field of reinforcement learning.

In the following chapter, we will implement a basic reinforcement learning algorithm to a couple of OpenAI Gym framework environments and get a better understanding of OpenAI Gym.

主站蜘蛛池模板: 金阳县| 昌吉市| 顺义区| 海门市| 平凉市| 弥勒县| 芦溪县| 霍邱县| 长春市| 观塘区| 永平县| 鹿邑县| 长丰县| 黔南| 利辛县| 安岳县| 睢宁县| 巴南区| 社会| 德兴市| 什邡市| 台东县| 平泉县| 商河县| 通江县| 海丰县| 大荔县| 屏东市| 迁西县| 吴旗县| 土默特左旗| 洪雅县| 灵武市| 平罗县| 海南省| 仲巴县| 永胜县| 晴隆县| 平昌县| 无锡市| 化隆|