Wu Bowen

Results 1 comments of Wu Bowen

Thank you for your reply and your clear explanation. I personally found that gradient will sometimes explode, causing the network to output nan, if rescaling is not properly applied (e.g.,...