官术网_书友最值得收藏!

Understanding SARSA and Q-Learning 

In this section, we will learn about SARSA and Q-Learning and how can they are coded with Python. Before we go further, let's find out what SARSA and Q-Learning are. SARSA is an algorithm that uses the state-action Q values to update. These concepts are derived from the computer science field of dynamic programming, while Q-learning is an off-policy algorithm that was first proposed by Christopher Watkins in 1989, and is a widely used RL algorithm. 

主站蜘蛛池模板: 天镇县| 牡丹江市| 班玛县| 涞水县| 乐业县| 恭城| 汨罗市| 辽中县| 奇台县| 大渡口区| 高邮市| 呼伦贝尔市| 平果县| 保康县| 襄汾县| 东安县| 双牌县| 那曲县| 夏邑县| 营山县| 澄江县| 锡林浩特市| 蓬安县| 旌德县| 尤溪县| 陵川县| 嘉荫县| 元氏县| 沁源县| 惠水县| 安多县| 开阳县| 昭平县| 隆子县| 宣威市| 庄河市| 玛多县| 东乡族自治县| 郎溪县| 南投市| 栾城县|