Minjia Zhang
Results
1
issues of
Minjia Zhang
It seems there is a bug in our DeepSpeed SQuDA finetune code. There are duplicated keys on dropout probability settings in the model configuration file. With the bug, it is...