João Ribeiro

Results 2 issues of João Ribeiro

Organized all code for the reinforcement learning reinforce example (matching the same on #867) Tested and working as per usual.

cla signed
reinforcement learning

Fixed a typo on the Config file.