📕 Node [[return]]

📄 Return.md by @KGBicheno

return

Go back to the [[AI Glossary]]

In reinforcement learning, given a certain policy and a certain state, the return is the sum of all rewards that the agent expects to receive when following the policy from the state to the end of the episode. The agent accounts for the delayed nature of expected rewards by discounting rewards according to the state transitions required to obtain the reward.

$$\text {Therefore, if the discount factor is } \lambda \text{, and } r_o,\ldots,r_n \text{ denote the rewards until the end of the episode, then the return calculation is as follows:} $$ The equation for return

Loading pushes...

Rendering context...

📕 Node [[2003 12 17 return of the awesome]] perhaps related

📕 Node [[2006 06 11 socialtext is going open source the return of jluster]] perhaps related

📕 Node [[20200607142108 roberts_returning_to_normal]] perhaps related

📕 Node [[emotionally charged tax return]] perhaps related

📕 Node [[extreme heat in the worlds oceans passed the point of no return in 2014]] perhaps related

📕 Node [[online platforms should return value to the citizen body]] perhaps related

📕 Node [[return to monkey island]] perhaps related

📕 Node [[returning to monkey island]] perhaps related

📕 Node [[returning to normal]] perhaps related

📕 Node [[tax return]] perhaps related

📕 Node [[the return of the no budget film contest]] perhaps related