Here is the simplest proof I know. It is a very straightforward application of the calculus of finite differences.
Let $f(x)$ be a polynomial. Consider its backward finite difference
$$(\Delta f)(x) = f(x) - f(x - 1).$$
Key lemma: If $f$ has leading term $a_n x^n$, then $\Delta f$ has leading term $n a_n x^{n-1}$. In particular, its degree is one less than that of $f$.
Sketch. Just expand out $f(x) - f(x - 1)$ using the binomial theorem. Notice that this is exactly how the derivative behaves, which is one reason it's nice: it's quite easy to remember. $\Box$
Now, write $\Delta^n f$ for the $n$-fold finite difference of $f$.
Corollary: If $f$ has leading term $x^n$, then $\Delta^n f = n!$.
On the other hand, what is $\Delta^n f$ more explicitly? Write $S$ for the operator which, given a function $f(x)$, returns the function $f(x - 1)$, and write $I$ for the identity operator which, given a function $f(x)$, returns the function $f(x)$ again. Then the backward finite difference can be written
$$\Delta f = (I - S) f$$
and hence, by the binomial theorem,
$$\Delta^n f = (I - S)^n f = \sum_{k=0}^n {n \choose k} (-1)^k S^k f$$
or, more explicitly,
$$(\Delta^n f)(x) = \sum_{k=0}^n {n \choose k} (-1)^k f(x - k).$$
You can also prove this identity by induction or using generating functions. However you want to prove it, once you combine it with the corollary above, setting $f(x) = x^n$ gives exactly the desired result.