Wu Bowen
Results
1
comments of
Wu Bowen
Thank you for your reply and your clear explanation. I personally found that gradient will sometimes explode, causing the network to output nan, if rescaling is not properly applied (e.g.,...