Kyunghwan Kim

Results 6 issues of Kyunghwan Kim

Add grouping depends on category when log by wandb. - Agent - integration test

![image](https://user-images.githubusercontent.com/17582508/87372118-c0123480-c5c1-11ea-937b-18124775924e.png) ![image](https://user-images.githubusercontent.com/17582508/87372164-d9b37c00-c5c1-11ea-9a03-af4732e80708.png)

I think this kind of flag name is better than previous flag. - `--load-from` -> `--ckpt-path` If you have other ideas, please leave comments below.

minor issue

A2C algorithm is implemented for continuous environment like Lunarlander-continuous now. We should implement A2C for discrete environment because its performance can be better in discrete env.

We should change Mujoco env to [Pybullet-gym](https://github.com/benelot/pybullet-gym) env because Mujoco license is expired. Pybullet-gym has a lot of continuous action environments include reacher, half-cheetah.