Jensen's inequality states that if $(X,\mu)$ is a measure space with $\mu(X) = 1$, $\phi$ is convex, and $f:X \rightarrow \mathbb R$ is integrable, then
$$\phi\left(\int fd\mu\right) \leq \int \phi \circ fd\mu.$$ I am trying to find an alternative proof which doesn't require the fact that convex functions are differentiable. I first prove the result for simple functions: if $f = \sum_{i=1} ^n a_i \chi_{A_i} $ and the $A_i$ to be disjoint with $X = \bigsqcup_{i=1} ^n A_i$. Then $\sum_{i=1} ^n \mu(A_i) = 1$, so
$$\phi\left(\int_X \sum_{i=1} ^n a_i \chi_{A_i} d\mu\right) = \phi\left(\sum_{i=1} ^n a_i \mu(A_i ) \right) \leq \sum_{i=1} ^n \phi(a_i) \mu(A_i ) = \int_X \phi \circ f d\mu .$$
I want to extend this to the general case by approximating $f$ with a sequence of simple functions $(f_n)$. However, I run into a problem showing that the integrals of $\phi \circ f_n$ converge to the integral of $\phi \circ f$. Any suggestions?