torchrl issues

This can be achieved with `Horovod`. As long as we respect the MPI environment variables, it should be fairly simple (hopefully!) to port existing code to support distributed training.

activatedgeek

priority/p2

area/feature

Reproduce experiments on harder Gym environments

Current list of experiments satisfy a POC. Need to support experiments on more complicated environments to make sure future experiments can be done faster. As a first, could do this...

activatedgeek

help wanted

area/algorithm

priority/p0

torchrl
torchrl copied to clipboard

Metadata

Support ONNX export of graphs

Reference Implementation of Soft-Actor Critic (SAC)

Reference Implementation for MCTS

Ability to record and store trajectories

Ability to save versioned docs

Hyperparameter Tuning

Support distributed training Out-of-the-Box

Reproduce experiments on harder Gym environments

← Metadata

Owner

Metadata

torchrl torchrl copied to clipboard

Metadata

← Metadata

Owner

Metadata

torchrl
torchrl copied to clipboard