We know that
$$\left( \begin{array}{cccc} 0 & 1 & 0 & 0 \\ 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ \end{array} \right)^2=\left( \begin{array}{cccc} 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 1 \\ 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 \\ \end{array} \right)$$
Why cannot we say that the square root of the later matrix is the former one? Can we somehow consistently define square root from nilpotent matrices?
I know that the equation $X^2=A$ would probably have more than one solution but why we cannot define the "main branch" exactly this way?