Sahil Sharma comments

Results 9 comments of


                                            Sahil Sharma

Dropout gradient and adjustment factor

Hi Oogi, I am facing a similar issue. I have described it [here](https://groups.google.com/forum/#!topic/theano-users/ycmGFMDXhD0). Please let me know if you figured out the problem. Thanks

Training does not learn anything

Thanks, I'll try that. Also roughly how many `episode_count` or `max_steps` did you have to train for before you arrived at the saved `.h5` models ?

Training does not learn anything

Oh, thanks. Which tracks are simple? And which ones are more complicated? I usually run torcs in practice mode. Is that one of the simpler ones?

the training loss is normal?

Hi. I don;t experience this error. Which dataset are you running your experiments on?

the training loss is normal?

I used celebA as well. I use tensorflow 0.10 GPU version. I use pip install to install TF.

how to test the model?

@apeterswu The reason for drawing from probability distribution over discrete action space is two-fold: 1. During Training this helps in exploration. Suppose you think that action 1 is the best...

how to test the model?

@lforg37 Under the assumption that the observations could be aggregated to get a MDP-like state, my statement is true :) The case you mention is that of a POMDP. Typically...

how to test the model?

@lforg37 thanks for pointing out that case :) I hadn't considered it.

about steps related to the reward

Btw the code which @congling has used is not equivalent to resetting game on life lost. :p This is because all it says is that if you just lost a...