Fábio Ferreira

Results 22 comments of Fábio Ferreira

![screen shot 2018-08-30 at 19 52 15](https://user-images.githubusercontent.com/4440904/44869600-494e4580-ac8e-11e8-8d05-a56f622ec1ca.png) Same issue here. When I source virtual environments, 'system' turns into the env name but not for conda environments. How to fix this?...

Hi ajinkyaT, thank you for your feedback! Unfortunately, I'm currently a bit too busy to improve the package with the requested functionality but I'd like to invite you to implement...

Thanks @ruoshiliu. Yes, I saw the loss. But in addition to optimizing the loss you also need to use the intrinsic rewards (which is the result from optimizing its loss)...

By that I mean reward = extrinsic reward + intrinsic reward. From the paper: ![31B16992-8338-4C37-A1AE-6983E1EB9AF1](https://user-images.githubusercontent.com/4440904/109923743-852f4f00-7cbf-11eb-80fd-26fad356d414.jpeg) I now realize that the paper says the extrinsic reward can be optional. Wondering what...

here: https://github.com/p-christ/Deep-Reinforcement-Learning-Algorithms-with-PyTorch/blob/79fc69c1010f91795ca5319bc263f5c3442b0d25/agents/DQN_agents/Dueling_DDQN.py#L47

Apparently the way I use GaussianNoise doesn't seem to add noise to my input since the following unittest fails: ``` X, Y = self.get_samples(std=1.0) model_no_noise = KernelMixtureNetwork(n_centers=5, x_noise_std=None, y_noise_std=None) model_no_noise.fit(X,Y)...

@vishnumadhu365 thank you for your reply and code example. I tried that but get a ```` AssertionError: The optimizer should be given for training mode ```` since the teacher is...