4359hh

Results 3 issues of 4359hh

Hello, Markov property means that the current action of the agent is only related to the current state s_t, but the input of the policy network in your open-source code...

def optimize(self): err = optimize(self.frames, self.points, local_window, fix_points, verbose, rounds) ... ???it puzzles me thanks in advance