官术网_书友最值得收藏!

Deep Q-learning

Deep Q-learning represents an evolution of the basic Q-learning method the state-action is replaced by a neural network, with the aim of approximating the optimal value function.

Compared to the previous approaches, where it was used to structure the network in order to request both input and action and providing its expected return, Deep Q-learning revolutionizes the structure in order to request only the state of the environment and supply as many status-action values as there are actions that can be performed in the environment.

主站蜘蛛池模板: 招远市| 虞城县| 巧家县| 阿拉尔市| 开远市| 随州市| 华池县| 依安县| 安顺市| 蒲江县| 崇仁县| 五峰| 庆城县| 平凉市| 富顺县| 靖远县| 无为县| 贵溪市| 阿瓦提县| 班戈县| 兴化市| 云南省| 屏边| 中方县| 朝阳市| 青冈县| 阜宁县| 邵阳市| 静宁县| 平谷区| 漳浦县| 江永县| 福鼎市| 临清市| 天津市| 黄浦区| 侯马市| 定州市| 陇南市| 彰化县| 石城县|