ddpg issues

DDPG Actor output saturate

Hello~ I have some question about DDPG Ｗhen my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and...

m5823779

Output of actor will saturate

Hello~ I have some question about DDPG Ｗhen my action dimension = 1, the result is good, but when my action dimension = 2 (the activation function is tanh and...

m5823779

Reacher-v1 not training

4

Hi, I have just tried running Reacher-v1 for 1000000 timesteps with default settings and it didn't learn anything (it just get stuck at -12 test reward), but it looks like...

amolchanov86

Unrealistic rewards for InvertedPendulum

1

Hi, Im running the code as-is for the InvertedPendulum-v1 environment. The output log looks like: ``` [2016-09-29 02:55:12,968] Making new env: InvertedDoublePendulum-v1 [2016-09-29 02:55:13,029] OpenGL_accelerate module loaded [2016-09-29 02:55:13,076] Using...

sahiliitm

ddpg
ddpg copied to clipboard

Metadata

DDPG Actor output saturate

Output of actor will saturate

Reacher-v1 not training

Unrealistic rewards for InvertedPendulum

← Metadata

Owner

Metadata

ddpg ddpg copied to clipboard

Metadata

DDPG Actor output saturate

Output of actor will saturate

Reacher-v1 not training

Unrealistic rewards for InvertedPendulum

← Metadata

Owner

Metadata

ddpg
ddpg copied to clipboard