never
Results
2
issues of
never
When training to 189 epoch, the training was interrupted in a server. It seems OK on my own computer with the same config. > Exception in thread Thread-4: > Traceback...
I ran ppo_run.py and got a .pkl file for HopperRandParamsEnv, of which the average reward was about 200 But when I ran meta_test.py with ProMP-trained policy, the average reward dropped...