Policy-Gradient-Methods icon indicating copy to clipboard operation
Policy-Gradient-Methods copied to clipboard

Query on SAC2018.py file

Open sprakashdash opened this issue 5 years ago • 0 comments

Could you give reference to paper as to why you chose to make two soft-q networks because they are independently working and you are taking the minimum of both while calculating value-loss?

sprakashdash avatar Apr 01 '20 05:04 sprakashdash