Unable to reproduce transfer results
I am trying to use this codebase to reproduce the failure of transfer with vanilla RL algorithms. However, I can not reproduce this phenomenon. It seems that this repo does not contain codes for saving trained models and the training accuracy is always zero on Breakout. I wonder if this is the original code the authors use. Also, I tried to use stable-baselines with wrappers in this repo, however, it seems that vanilla algorithms can successfully transfer to perturbed environments.
I wonder if there is something I missed since the results are contradictory to the paper.
I am facing the same issue.
I am trying to use this codebase to reproduce the failure of transfer with vanilla RL algorithms. However, I can not reproduce this phenomenon. It seems that this repo does not contain codes for saving trained models and the training accuracy is always zero on Breakout. I wonder if this is the original code the authors use. Also, I tried to use stable-baselines with wrappers in this repo, however, it seems that vanilla algorithms can successfully transfer to perturbed environments.
I wonder if there is something I missed since the results are contradictory to the paper.
I am facing the same issue. Can I make a contact with you? My email is [email protected]