In softmax equation 4.1, the book says:
Consider what happens when all the xi are equal to some constantc. Analytically,we can see that all the outputs should be equal to1/n. Numerically, this may not occur when c has large magnitude.
Whats the reason for this?
Read more… (45 words)