How did we arrive at the sigmoid function for calculating probabilities?
Why not use some other function that " squashes " the values to lie between [0, 1]. Maybe even just normalise the values so they all add up to one.
How did we arrive at the sigmoid function for calculating probabilities?
Why not use some other function that " squashes " the values to lie between [0, 1]. Maybe even just normalise the values so they all add up to one.
I think a really nice explanation for the popularity of the sigmoid function is in these lecture notes (http://www.stat.cmu.edu/~cshalizi/uADA/12/lectures/ch12.pdf)
The problem is that logarithms are unbounded in only one direction, and linear functions are not.
Aren't logarithms actually unbounded in both directions?
– pX0r Jun 08 '18 at 06:42