toke1220
Results
1
comments of
toke1220
> Did you set smaller learning rate? Since you train the model on one GPU, the batchsize is only 1/8 of the original. Thus, the original learning is too large...