(François) Hoa T. Le

Results 2 issues of (François) Hoa T. Le

Hi, It seems that Perplexity is normalized twice & norm_term of NLLLoss should be masked out as well.

bug
medium priority

Hi, I see that this implementation is lacking masked attention on encoder. Input_lengths should be passed to decoder (not just encoder) in order to compute this. OpenNMT already provided this...

medium priority