Return and value funtions definitions for RL (pages 680-683) #47
labdmitriy
started this conversation in
General
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi Sebastian,
There is a definition of the return on the page 680, where we have multiplier gamma^k in the sum for R(t+k+1).
But for value and action-value functions we have gamma^(k+1) multiplier for the same R(t+k+1) on the pages 682-683.
Why do we replace the term with gamma^(k+1) multiplier instead of gamma^k in the latter definitions?
Thank you.
Beta Was this translation helpful? Give feedback.
All reactions