typoverflow
typoverflow
- This is for compatibility with newer versions of PyTorch, where `mode` is changed to be a class property of distributions. See https://github.com/pytorch/pytorch/pull/76690 for details.
Hi, I have one tiny question about the cuda kernel. In the code, `aa` and `bb` are running sums. To avoid overflow, you divided `exp(-p)` both when computing `y[ii]` and...
### Prerequisites - [X] I have tried updating UtilsRL to the newest version. - [X] I have checked both open and closed issues but found nothing related to my request....
### Prerequisites - [X] I have tried updating UtilsRL to the newest version. - [X] I have checked both open and closed issues but found nothing related to my request....
### Prerequisites - [X] I have tried updating UtilsRL to the newest version. - [X] I have checked both open and closed issues but found nothing related to my request....
### Prerequisites - [X] I have tried updating UtilsRL to the newest version. - [X] I have checked both open and closed issues but found nothing related to my request....
### Prerequisites - [X] I have tried updating UtilsRL to the newest version. - [X] I have checked both open and closed issues but found nothing related to my request....
Request to implement another Linear module, `ResetLinear`, which natively supports the Reset mechanism. It can accept parameters like `reset_interval`, which designates the step interval to reset the whole parameters of...
Implemented POR from *A Policy-Guided Imitation Approach for Offline Reinforcement Learning*