allen comments

Repositories
Issues
Comments

Results 3 comments of


                                            allen

Why compute prediction over decoder_inputs instead of encoder_inputs?

Agree with @rahul-iisc at test time decoder_inputs is the same as encoder inputs running main.py we have test precision about 63% (Step: 99900) At training time we have two options:...

This model for MLM is waste of time, why did you even made it if it cannot be used?

Might be some mistakes? Even using huggingface basic MLM pipline continue pretrain on deberta-v3-large works. Tested on kaggle nbme dataset.

Where is the code for "Pointing the Unknown Words"

for those willing to find the code, you can refer to this nice work https://github.com/abisee/pointer-generator I will implement new pointer-generator related work based on tf 1.2 attention_wrapper, that is dynamic...