Lei Wang

Results 2 issues of Lei Wang

Hi, when I am working on this model, the training loss is 'nan' since the beginning. Step: 0, Train loss: nan, Time: 9.87615s Step: 1, Train loss: nan, Time: 1.97912s...