David Yanguas Rojas comments

Repositories
Issues
Comments

Results 2 comments of


                                            David Yanguas Rojas

Fixed error of run_policy.py with python 3.6

It is not hard to solve. I added the list casting to the lin_policy object (around line 23) and then it worked: ``` lin_policy = np.load(args.expert_policy_file) lin_policy = list(lin_policy.items()) lin_policy...

About SHIFT

They explain that in the article... the idea is to supress the survival bonus from the reward function in order to avoid some local optima. In hopper the survival bonus...