官术网_书友最值得收藏!

Deep Q-learning

Deep Q-learning represents an evolution of the basic Q-learning method the state-action is replaced by a neural network, with the aim of approximating the optimal value function.

Compared to the previous approaches, where it was used to structure the network in order to request both input and action and providing its expected return, Deep Q-learning revolutionizes the structure in order to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.

主站蜘蛛池模板: 怀集县| 那曲县| 驻马店市| 三河市| 长岛县| 信阳市| 旅游| 喀什市| 凤山县| 新河县| 海南省| 肇东市| 宿松县| 资讯 | 屏东县| 伊川县| 星子县| 雅江县| 固原市| 塔河县| 广平县| 南昌市| 哈巴河县| 高淳县| 赞皇县| 越西县| 临泉县| 福泉市| 盐亭县| 嘉祥县| 永修县| 玛纳斯县| 天柱县| 丰顺县| 通辽市| 崇信县| 龙川县| 宁安市| 海城市| 嘉峪关市| 汤阴县|