Chimp icon indicating copy to clipboard operation
Chimp copied to clipboard

Simulator Visualization During Training

Open etotheipluspi opened this issue 9 years ago • 3 comments

Does it make sense for the visualization to be on while experience tuples are being generated?

We can just visualize during evaluation, since that will show how well the policy is doing during that training cycle.

etotheipluspi avatar Mar 03 '16 19:03 etotheipluspi

Yep, I think dynamic visualization of the (1) loss, (2) reward per episode, (3) Q-values avg. would be awesome. Any ideas what would be the best way to implement that?

yegortk avatar Mar 07 '16 01:03 yegortk

Dynamic plots of the loss etc would also be a great idea.

For this issue, I meant the simulator visualization, e.g. the Atari screen. I'm not sure if it makes sense to visualize during training, but maybe visualization during the evaluation might be useful?

etotheipluspi avatar Mar 07 '16 04:03 etotheipluspi

Ah, got it. In my experience, evaluation part passes too fast, so visualization appears for a very short time - training might take a pretty long time, and seeing the algorithm play helps mentally :)

yegortk avatar Mar 07 '16 04:03 yegortk