Language-Modeling-GatedCNN
Language-Modeling-GatedCNN copied to clipboard
Could you please provide the log for training with the default configuration?
I train the model with the provided dataset and configuration. And the initial loss is huge, e.g. 8~9e+10. I think there is something wrong with my training. Could you please provide the log for training. Thank you!
Me too, have you found any solution? @Jar7
I have the same problem loss exaggerate to 4389657512261976064.00 rapidly.
Use Adam optimizer will get a lower loss but still higher than normal