書名： Python Reinforcement Learning
作者名： Sudharsan Ravichandiran Sean Saito Rajalingappaa Shanmugamani Yang Wenzhuo
本章字?jǐn)?shù)： 30字
更新時(shí)間： 2021-06-24 15:17:33

Solving the Bellman equation

We can find the optimal policies by solving the Bellman optimality equation. To solve the Bellman optimality equation, we use a special technique called dynamic programming.

官术网_书友最值得收藏!

Python Reinforcement Learning

Solving the Bellman equation