官术网_书友最值得收藏!

Deep Q-learning

Deep Q-learning represents an evolution of the basic Q-learning method the state-action is replaced by a neural network, with the aim of approximating the optimal value function.

Compared to the previous approaches, where it was used to structure the network in order to request both input and action and providing its expected return, Deep Q-learning revolutionizes the structure in order to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.

主站蜘蛛池模板: 大名县| 银川市| 全州县| 和静县| 芒康县| 蓝田县| 金川县| 云阳县| 沂南县| 西乌珠穆沁旗| 保山市| 西安市| 霍山县| 精河县| 浙江省| 南充市| 镇原县| 金昌市| 宁都县| 海淀区| 新昌县| 桂阳县| 册亨县| 确山县| 土默特左旗| 黎川县| 安福县| 西安市| 瓮安县| 南平市| 青浦区| 古田县| 弥勒县| 涟源市| 盐山县| 虞城县| 土默特右旗| 阜康市| 郧西县| 徐闻县| 东港市|