ddpg
ddpg copied to clipboard
TensorFlow implementation of the DDPG algorithm from the paper Continuous Control with Deep Reinforcement Learning (ICLR 2016)
Hello~ I have some question about DDPG When my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and...
Hello~ I have some question about DDPG When my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and...
Hi, I have just tried running Reacher-v1 for 1000000 timesteps with default settings and it didn't learn anything (it just get stuck at -12 test reward), but it looks like...
Hi, Im running the code as-is for the InvertedPendulum-v1 environment. The output log looks like: ``` [2016-09-29 02:55:12,968] Making new env: InvertedDoublePendulum-v1 [2016-09-29 02:55:13,029] OpenGL_accelerate module loaded [2016-09-29 02:55:13,076] Using...