agents icon indicating copy to clipboard operation
agents copied to clipboard

Extracting additional variables from the environment?

Open JohnBurden opened this issue 4 years ago • 1 comments

Hi,

I have been adapting the DQN tutorial file for a.custom environment. It seems to learn fine, however I have an additional metric that I want to extract and plot that isn't reward. It corresponds to "safety" in this environment. I'm wondering if there is a standard way of extracting this from the evaluation stage. Do I need to extend the TimeStep class or can I do something else?

Thanks.

JohnBurden avatar Mar 25 '21 00:03 JohnBurden

Yes one option is to extend the TimeStep class, e.g. you can add an info field. Or you can add a property in the environment that stores the last value of safety every time you make a step.

kbanoop avatar Apr 06 '21 16:04 kbanoop