官术网_书友最值得收藏!

  • Python Reinforcement Learning
  • Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
  • 96字
  • 2021-06-24 15:17:33

Dynamic programming

Dynamic programming (DP) is a technique for solving complex problems. In DP, instead of solving complex problems one at a time, we break the problem into simple sub-problems, then for each sub-problem, we compute and store the solution. If the same sub-problem occurs, we will not recompute, instead, we use the already computed solution. Thus, DP helps in drastically minimizing the computation time. It has its applications in a wide variety of fields including computer science, mathematics, bioinformatics, and so on.

We solve a Bellman equation using two powerful algorithms:

  • Value iteration
  • Policy iteration
主站蜘蛛池模板: 曲水县| 定日县| 乐亭县| 饶阳县| 桃江县| 错那县| 昌都县| 贵德县| 平湖市| 蓬莱市| 前郭尔| 宣化县| 迭部县| 亚东县| 丘北县| 宜兰县| 林甸县| 和硕县| 谢通门县| 雅江县| 宜城市| 丰原市| 彭阳县| 枞阳县| 道孚县| 昭苏县| 白山市| 吴川市| 商洛市| 大兴区| 山西省| 巨鹿县| 许昌县| 平潭县| 平和县| 建始县| 忻城县| 郁南县| 贵定县| 会东县| 平塘县|