Deep_reinforcement_learning_Course
Deep_reinforcement_learning_Course copied to clipboard
bug in space invaders
the line: self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_))
should be: self.Q = tf.reduce_sum(tf.multiply(self.output, self.actions_), axis=1)