sdahan12
sdahan12
Thanks for this great code!
hi, thank you for this implementation... i was wondering - i saw that in your code you took 10 dimensions for the mnist, while in the paper they took 50...
is it normal that it takes about 30min for epoch? with the normal setting and on a pretty strong GPU like V100? you recommended 500epochs its more then 10 days...
hi, in part 2 there is a calculation of dzdx_empirical , can somone help and explain the calculation? why is there a sum? and why is there a division by...
hi, can you share the param_file? if i just want to use the network without training over my data?