官术网_书友最值得收藏!

書名： TensorFlow Reinforcement Learning Quick Start Guide
作者名： Kaushik Balakrishnan
本章字數： 60字
更新時間： 2021-06-24 15:29:08

Questions

Is a replay buffer required for on-policy or off-policy RL algorithms?
Why do we discount rewards?
What will happen if the discount factor is γ > 1?
Will a model-based RL agent always perform better than a model-free RL agent, since we have a model of the environment states?
What is the difference between RL and deep RL?

主站蜘蛛池模板：定州市| 丰县| 潢川县| 沧州市| 四会市| 临武县| 德江县| 阳原县| 旬邑县| 陈巴尔虎旗| 平和县| 丹凤县| 罗平县| 丰城市| 沛县| 天台县| 丰顺县| 华坪县| 安达市| 婺源县| 民权县| 山东省| 兖州市| 靖州| 卢氏县| 江阴市| 兴和县| 柳江县| 独山县| 岗巴县| 陆河县| 鹤壁市| 绥棱县| 兴仁县| 庆安县| 灵台县| 西乌珠穆沁旗| 错那县| 江川县| 郴州市| 黄浦区|