Unamu7simure

Results 1 issues of Unamu7simure

Q-learning formula (18.3.10) seems to be only for non-terminal states. If St is one of terminal states (gold or traps), Q table should not be renewed and should keep the...