pytorch-policy-gradient-example icon indicating copy to clipboard operation
pytorch-policy-gradient-example copied to clipboard

A toy example of Policy Gradient implemented in Pytorch

pytorch-policy-gradient-example

Train an agent for CartPole-v0 using naive Policy Gradient.

Inspired by Andrej Karpathy's blog.

Code partly from Pytorch DQN Tutorial

Solved in 500 episodes (Avg Reward):

alt text