Zifan Xu
Results
1
issues of
Zifan Xu
PPO training returns nan when using multiple GPU. Forcing t use one GPU works fine. I just ran the exactly same code in training code in [Brax Training](https://colab.research.google.com/github/google/brax/blob/main/notebooks/training.ipynb). Can somebody...