agents
agents copied to clipboard
Feature Request: Flag whether last action was random or not
I have a project where I would like to know whether the last action applied to the environment came from the agent's policy or from a random action (as a result of the epsilon greedy policy).
Is it possible to update the PolicyStep class to include an additional attribute to support this?