JesseLT

Results 3 comments of JesseLT

The agent could see the next state is when function `step(action)` return the `state`, not when the `self._update_state` is executed.

Come on! fix it! @zhumingpassional