magic282 comments

Results 8 comments of


                                            magic282

Machine Translation Save and Load model problem

+1. I simply replace '-' back to '/'. It can work, but the perp seems to be wrong after resuming training. I guess that there may be something wrong with...

Machine Translation Save and Load model problem

@rizar I tried to replace the checkpoint code with the latest code in saveload.py. It can be loaded, but it seems the state or something is messed. INFO:blocks.algorithms:Initializing the training...

Machine Translation Save and Load model problem

@orhanf I guess so. I was using AdaGrad. So will the dump contain the adaptive algorithms' accumulators?

Machine Translation Save and Load model problem

@Thrandis I tried retraining with blocks 0.1.1 and 0.2.0 release, and found both of them have this problem. (I didn't load a model saved by other blocks code). @rizar I...

Too old versions

It seems that recently mxnet 0.9.4 fixed a bug and can get better memory perfomance for bucket models, especially for RNN.

Too old versions

Great! Also, I am very curious about pytorch and torch. Are their performance comparable or not?

inference on spark with scala

Feel free to do what you like. The next branch does not have the inference part. The master branch does not work with the lastest mxnet.

multi gpu error

mxnet slices the batch to do a data parallelization which causes this error.