Go back to the [[AI Glossary]]
#rl
In reinforcement learning, the following identity satisfied by the optimal Q-function:

Reinforcement learning algorithms apply this identity to create Q-learning via the following update rule:

Beyond reinforcement learning, the Bellman equation has applications to dynamic programming. See the Wikipedia entry for Bellman Equation.
Expanding this section will automatically generate an AI synthesis of the contributions in this node.
Rendering context...