PPO-Pytorch
PPO-Pytorch copied to clipboard

dai-dao

→

Metadata

Implementation of PPO in Pytorch

Readme
Issues

PPO - PyTorch

This implementation is inspired by:

OpenAI Tensorflow code: https://github.com/openai/baselines/tree/master/baselines/ppo2
https://github.com/ikostrikov/pytorch-a2c-ppo-acktr

To run training:

python trainer_plus.py

Comparison between OpenAI implementation and this implementation in Atari game BreakOut:

Comparison

Disclaimer

The Pytorch implementation is much cleaner and runs a bit faster in terms of wall-clock time, yet still achieve comparable performance in the BreakOut environment.

About

Implementation of PPO in Pytorch

41

Stars

2

Forks

Watchers

Owner

dai-dao

← Metadata

41

Stars

2

Forks

Watchers

Owner

dai-dao

Metadata

Implementation of PPO in Pytorch

Back

PPO-Pytorch PPO-Pytorch copied to clipboard

Metadata

PPO - PyTorch

This implementation is inspired by:

Disclaimer

← Metadata

Owner

Metadata

PPO-Pytorch
PPO-Pytorch copied to clipboard