(François) Hoa T. Le
Results
2
issues of
(François) Hoa T. Le
Hi, It seems that Perplexity is normalized twice & norm_term of NLLLoss should be masked out as well.
bug
medium priority
Hi, I see that this implementation is lacking masked attention on encoder. Input_lengths should be passed to decoder (not just encoder) in order to compute this. OpenNMT already provided this...
medium priority