David Slayback
David Slayback
Hi, I'm a PhD student doing work in hierarchical reinforcement learning (specifically [Option-critic-based algorithms](https://arxiv.org/abs/1709.04571)), and I've found this repository to be a particularly helpful starting point when trying to prototype...
### 🐛 Describe the bug [File](https://gist.github.com/DavidSlayback/dd327d17d41e67267f0026efb0686fc5) In an RL workflow, when I use CUDA, JIT, and a LayerNorm together in this script, I get the following error ```python /home/david/Documents/GitHub/cleanoc/cleanrl/ppo_lngru_jit.py:281: UserWarning:...
I've been digging into Brax as a potential alternative to some modified dm_control enviornments I've been using and am really loving the speedup! That said, I feel like I've run...
## Description I've probably bit off more than I can chew, but I'm trying to implement most of the algorithms in the "option-critic" family of hierarchical RL methods. ## Types...
### Code of Conduct - [X] I have read and agree to the GitHub Docs project's [Code of Conduct](https://github.com/github/docs/blob/main/.github/CODE_OF_CONDUCT.md) ### What article on docs.github.com is affected? [Using Filters](https://docs.github.com/en/actions/using-workflows/triggering-a-workflow#using-filters) ### What...
### What happened? Previously (under Ibis 7.2.0), we had defined a `SAFE_CAST` operation for our usage of Ibis with bigquery with the following: ```python import ibis import ibis.expr.datatypes as dt...
**Is your feature request related to a problem? Please describe.** It's a small issue, but in a repo that is attempting to transition from Pandas to Polars over time, there...
Forgive me for asking here, but I'm struggling to find relevant examples in the docs, and I'm not quite sure how to handle this. If I want to aggregate scalar...