DeepRL_Algorithms icon indicating copy to clipboard operation
DeepRL_Algorithms copied to clipboard

DeepRL algorithms implementation easy for understanding and reading with Pytorch and Tensorflow 2(DQN, REINFORCE, VPG, A2C, TRPO, PPO, DDPG, TD3, SAC)

Results 8 DeepRL_Algorithms issues
Sort by recently updated
recently updated
newest added

Bumps [numpy](https://github.com/numpy/numpy) from 1.18.4 to 1.22.0. Release notes Sourced from numpy's releases. v1.22.0 NumPy 1.22.0 Release Notes NumPy 1.22.0 is a big release featuring the work of 153 contributors spread...

dependencies

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.4.0 to 2.7.2. Release notes Sourced from tensorflow's releases. TensorFlow 2.7.2 Release 2.7.2 This releases introduces several vulnerability fixes: Fixes a code injection in saved_model_cli (CVE-2022-29216) Fixes...

dependencies

There is a error like this: AttributeError: 'MountainCarEnv' object has no attribute 'seed' How to solve it ?Thank you!

Bumps [tensorflow](https://github.com/tensorflow/tensorflow) from 2.4.0 to 2.11.1. Release notes Sourced from tensorflow's releases. TensorFlow 2.11.1 Release 2.11.1 Note: TensorFlow 2.10 was the last TensorFlow release that supported GPU on native-Windows. Starting...

dependencies

May I ask the author, after the training of a2c algorithm, shouldn't the randomness be eliminated when testing?

May I ask the author why def forward is pass? Shouldn't there be forward propagation?

May I ask the author why I used test.py to test at the end of training a2c, but when the state did not change, it produced a different action.

I built a maze environment myself, but I couldn't find results when I used the weight file at the end of training to test it. Do I need to train...