Go back to the [[AI Glossary]]
#rl
In reinforcement learning, the entity that uses a policy to maximize expected return gained from transitioning between states of the environment.
Expanding this section will automatically generate an AI synthesis of the contributions in this node.