Results 3 issues of RanW

Hi Aravind, I used your default setting. It looks like the objective has been maximized but the reward was not recovered.

## Describe the bug If the metric one is trying to log with the csv logger has `/` in its name, you will get a `No such file or directory`...

bug

Hi Jia Lian, I replicated your code in Pytorch and used the same number of hidden units and position only. My reward is also just a function of state not...