CodeRL icon indicating copy to clipboard operation
CodeRL copied to clipboard

Question about pre-training process

Open natedingyifeng opened this issue 3 years ago • 1 comments

Hi, I am curious about what validation task you are using during the pre-training process. Could you please share some information about this issue?

natedingyifeng avatar Jul 28 '22 15:07 natedingyifeng

Hi, for each stage (either MSP or NTP task) of pretraining, we employ a small proprotion of training data as the held-out validation set and monitor the corresponding loss (either MSP or NTP loss) on this subset. We stop the pretraining when it converges (or in other words, the validation loss does not decrease).

yuewang-cuhk avatar Aug 10 '22 09:08 yuewang-cuhk