Zifan Xu

Results 1 issues of Zifan Xu

PPO training returns nan when using multiple GPU. Forcing t use one GPU works fine. I just ran the exactly same code in training code in [Brax Training](https://colab.research.google.com/github/google/brax/blob/main/notebooks/training.ipynb). Can somebody...