Unamu7simure
Results
1
issues of
Unamu7simure
Q-learning formula (18.3.10) seems to be only for non-terminal states. If St is one of terminal states (gold or traps), Q table should not be renewed and should keep the...