Yassien Shaalan

Results 1 comments of Yassien Shaalan

Thanks I will look into the evaluation part, however, there still some correlation between the gradient explosion (causing the nan loss) and the number of batches per epoch (may be...