vime
vime copied to clipboard
Code for the paper "Curiosity-driven Exploration in Deep Reinforcement Learning via Bayesian Neural Networks"
Could the algorithm be used on reinforced learning algorithms with experience reply?
Hi, Can I ask if anybody has reproduced the results of Halcheetahx experiment in the vime paper? Can anybody show me the hyperparameters you chose? Thanks!
If I understand right, run_trpo_expl.py is trpo + vime, so run_trpo is trpo w/o vime?
This is the figure I'm referring to:
[In discussion with Jose Miguel Hernandez-Lobato @jmhernandezlobato and Daniel Hernandez-Lobato @danielhernandezlobato] The current exploration objective used in the paper is a sum of expected reductions in entropy of the parameters...