garage icon indicating copy to clipboard operation
garage copied to clipboard

Rework logic for filling and checking replay buffer in torch sac, dog, and td3

Open avnishn opened this issue 5 years ago • 1 comments

Currently in sac, train once returns none if the replay buffer doesn't have the minimum number of timesteps in it.

This function should still return some value or raise an exception.

avnishn avatar Dec 09 '20 18:12 avnishn

_train_once is either private, or should be private, so I'm not sure what purpose this exception would have.

ryanjulian avatar Dec 09 '20 18:12 ryanjulian