Summer Yue comments

Results 16 comments of


                                            Summer Yue

Can tf.agent policy return probability vector for all actions?

Ummm could you print out `env.action_spec()` for your environment?

`PPOAgent` ignores network.losses

Could you please elaborate? I see that the `l2_regularization_loss` includes the regularization losses and was added to total_loss. The coefficients are default to 0, but if you wish to use...

`PPOAgent` ignores network.losses

Oh I see. Thanks for explaining. So you are pointing at the inconsistency in the implementation between PPO and other agents in terms of where losses are calculated, not that...

`PPOAgent` ignores network.losses

Sorry for the delayed response. We are not sure why this inconsistency exists, probably be some historical reasons. We could look into this once we get some bandwidth. Also feel...

PPO policy with ActorDistributionNetwork and discrete action array

Thank you for reporting. It's a little hard to know exactly what's going on. Could you help print out both `action_output_spec` and `action_spec` so we know why it doesn't match?

PPO policy with ActorDistributionNetwork and discrete action array

Thanks for providing the addition information! I think you're right. I was able to reproduce your issue in a simple example in Colab. I'll follow up here with a more...

Display of checkpoint information saved in tf_agents.utils.common.Checkpointer

Underneath you're using tf.train.CheckpointManager.save(). Your question is how to inspect a checkpointed file from Tensorflow. There's been some discussions about potentially using the `inspect_checkpoint.py` tool http://github.com/tensorflow/tensorflow/blob/master/tensorflow/python/tools/inspect_checkpoint.py (full disclosure I haven't...

Summer Yue

Can tf.agent policy return probability vector for all actions?

`PPOAgent` ignores network.losses

`PPOAgent` ignores network.losses

`PPOAgent` ignores network.losses

PPO policy with ActorDistributionNetwork and discrete action array

PPO policy with ActorDistributionNetwork and discrete action array

Display of checkpoint information saved in tf_agents.utils.common.Checkpointer

Error using a trained PPO policy

Error using a trained PPO policy

Error using a trained PPO policy