alphadev
alphadev copied to clipboard
Applying last action twice when performing mcts
In alphadev.py:1031, isn't this applying the last action twice? The first time being on line 1025. Also, on line 1034 why don't we use legal_actions but instead expand on all action space?