rl-plotter
rl-plotter copied to clipboard
Could you please add the way to deal with rewards with same steps in a multi processes training?
In my training process, I use a multi-processes PPO.
When I want to draw the reward curve with rl-plotter, I found that:
Just like the image, there are rewards with the same steps. But it seems that it only shows one point in the curve?
Hi, I also meet the same challenge, have you solved it?
Hi, I also meet the same challenge, have you solved it?
No... I didn't manage to solve the problem because it seems that the curve looks no problem..