reinforcejs icon indicating copy to clipboard operation
reinforcejs copied to clipboard

GridWorld: TD, Demo Page: Discounted Reward greater than 1.0?

Open TheMen4ce opened this issue 4 years ago • 0 comments

Using the initial settings, how can the discounted reward of the center field be 1.1? The max reward the agent can get is 1.0 and then the goal is reached and the agent is reset.

Also, if changing the field below to R 1.0, I'd expect the discounted reward to be 10 instead of 9.9: image

and here 50 instead of 49.90: image

TheMen4ce avatar May 05 '21 15:05 TheMen4ce