RLMilestone

Results 1 issues of RLMilestone

``` if super(Supervisor, self).step(self.timestep) == -1: exit() self.apply_action(action) return ( self.get_observations(), self.get_reward(action), self.is_done(), self.get_info(), ) ``` In RL, it seems to be more natural to apply_action and then Supervisor.step(). Otherwise,...

refactor