Dupe factor in create_pretraining_data.py

Open gerwindekruijf opened this issue 3 years ago • 0 comments

Explanation of dupe_factor: Number of times to duplicate the input data (with different masks). Did the original BERT use duplicated data? I think the most obvious value should be 1 i.e. not duplicate at all. Let me know if setting the dupe_factor to something like {5,10} is beneficial and does not make the model overfit.

May 11 '22 12:05 gerwindekruijf