Ryan Julian
Ryan Julian
@krzentner I'm curious: what is your implementation plan for RNNs in torch/VPG? Is it to make the optimization trajectory-oriented (as in tf/VPG) or something else?
@JeongHyunho thank you for this issue report! This indeed looks like a bug, but I'd like to verify. Does `self._qf2` have the same parameters as `self._qf1`?
`_train_once` is either private, or should be private, so I'm not sure what purpose this exception would have.
thanks @richardliaw !
These all look great to me and I look forward to reviewing the PRs! If you don't think they will all land around the same time, feel free to split...
If CG optimizer can't be used with RNNs (I don't think that's actually the case), we should detect that and raise an error.
Can you add a little bit more explanation for the design here? I'm concerned about using an ADT as the blanket input to policies, which makes the interface pretty complicated...
@guoyijie you can send us a tensorboard link by using the https://tensorboard.dev service. Do you mind trying MT1-reach? That would be a more reliable indictation of a possible problem.
What version of TensorFlow is installed? `python -c 'import tensorflow as tf; print(tf.__version__)` within the Python environment you're running garage in will tell you. I'm guessing the problem may be...