DQN-tensorflow
DQN-tensorflow copied to clipboard
Possible bug of using numpy randint
Code of line 62 in replay_memory.py: index = random.randint(self.history_length, self.count - 1) should change to index = random.randint(self.history_length, self.count) Correct ?