Tianming Cui comments

Repositories
Issues
Comments

Results 1 comments of


                                            Tianming Cui

Can tf.agent policy return probability vector for all actions?

might be helpful: go to the greedy_policy.py, and find the function def _distribution(self, time_step, policy_state) Where we find that this function returns DeterministicWithLogProb(loc=greedy_action) where greedy_action = dist.mode(), and that is...