I'm reading about optimization problems involving mixed norms. In particular I'm getting acquainted with the $\ell_{2,1}$ norm. For a matrix $\mathbf{X}$, the $\ell_{\alpha,\beta}$ norm, $\|\mathbf{A}\|_{\alpha,\beta}$ is defined as,
$$\|\mathbf{A}\|_{\alpha,\beta} = \big( \sum_i \|\mathbf{A}_i\|_\alpha^\beta \big)^{\frac{1}{\beta}}$$
where, $\mathbf{A}_i$ is the $i^{th}$ column of $\mathbf{A}$. According to this definition, $\mathbf{A}_{2,1}$ can be written as,
$$\|\mathbf{A}\|_{2,1} = \sum_i \|\mathbf{A}_i\|_2$$
Now my question is, why is the $\ell_{2,1}$ norm non-smooth? As far as I understand, smoothness of a function is related to continuous differentiability. Is there any reason that even if the $\ell_2$ norm is differentiable a sum of $\ell_2$ norms is non-differentiable. So in short, my question is, why $\ell_{2,1}$ norm is non-smooth? Can it be proved to be a non-smooth function? Will appreciate your help in understanding these concepts.