Easy trick to avoid overflow in softmax

Open timvieira opened this issue 7 years ago • 1 comments

I see that you are using an "unsafe" softmax distribution on line 308 of parser.py. This line is likely to overflow if scores * alpha is large. There is a simple trick to make overflow in softmax impossible.

import numpy as np

def softmax(scores, alpha):
    x = scores * alpha
    x -= x.max()
    np.exp(x, out=x)
    x /= x.sum()
    return x

print softmax(np.random.uniform(-300, 300, 10), 10)

This code is lightly optimized to reuse temporary arrays.

Apr 28 '18 00:04 timvieira

I'm aware of this issue, but did not found this code ran into that problem in practice. Would be happy to accept a pull request.

Apr 30 '18 22:04 jhcross