torchrl icon indicating copy to clipboard operation
torchrl copied to clipboard

Highly Modular and Scalable Reinforcement Learning

Results 8 torchrl issues
Sort by recently updated
recently updated
newest added

Allow ONNX export of computational graphs so that they can be used in places beyond the repo.

priority/p2
area/feature

Paper: https://arxiv.org/abs/1801.01290 Blog Post: https://bair.berkeley.edu/blog/2018/12/14/sac/

area/algorithm
priority/p1
good first issue

area/algorithm
priority/p2
good first issue

Storing trajectories off-policy is helpful for algorithms which learn off policy. This may or may not be needed.

priority/p0
area/feature

Currently only `master` branch is deployed.

priority/p3
area/feature

Most new algorithms will require a pre-built tuning framework.

help wanted
area/algorithm
priority/p2

This can be achieved with `Horovod`. As long as we respect the MPI environment variables, it should be fairly simple (hopefully!) to port existing code to support distributed training.

priority/p2
area/feature

Current list of experiments satisfy a POC. Need to support experiments on more complicated environments to make sure future experiments can be done faster. As a first, could do this...

help wanted
area/algorithm
priority/p0