Wei Xiong issues

Repositories
Issues
Comments

Results 1 issues of


                                            Wei Xiong

sign of the entropy

In the implementation of A2C, the code is policy_loss += self.entropy_weight * -log_prob # entropy maximization But I think since we are maximizing the entropy, in the loss, we shall...