πŸ“š Node [[bellman_equation]] exact match β˜†
Nodes contain individual contributions whose filenames match your search. x

Bellman equation

Go back to the [[AI Glossary]]

#rl

In reinforcement learning, the following identity satisfied by the optimal Q-function:

The Q-function in reinforcement learning

Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:

The Bellman equation

Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.

Loading pushes...

Rendering context...