If $f : E \to F$ is a continuous map of Banach spaces, with bounded Fréchet derivative. Then
$x_0,x_1 \in E\Rightarrow \|f(x_1) − f(x_0)\| ≤ M\|x_1 − x_0\|$ where $M = \sup \|f'(x)\|.$
The most efficient way to prove this, as far as I know, is to apply Hahn-Banach. Alternatively, one starts with $g(t)= f(x_0 + t(x_1 − x_0));\ 0\le t\le 1$ and reduces the problem to the real case. But, of course, the Hahn-Banach theorem is a (relatively) big gun, and to do the integral if one uses $g$, then integration in Banach spaces must be dealt with (via regulated functions, for example.)
The only elementary proof I have seen uses the dot product (Apostol, Rudin), but then of course, one must assume an inner product.
Below I sketch a very basic proof, but it required more effort than I expected, so my question is: can one get it cheaper, using only elementary means (very basic facts about normed spaces and the definition of derivative)?
Assume for convenience that $x_0=f(x_0)=0$ and consider the segment $\{tx_1:\ 0\le t\le 1\}.$
We have $\|f(x)\|\le M\|x\|+\|r(x)\|\cdot\|x\|$ where $\frac{\|r(x)\|}{\|x\|}\to 0$ as $x\to 0.$
So, if $\epsilon>0,$ we can choose $\delta>0$ such that $t\le \min \left(1,\frac{\delta}{\|x_1\|}\right)\Rightarrow \|f(tx_1)\| ≤ (M + \epsilon)t\|x_1\|$. To prove the claim, it will suffice to prove that $t=1$ satisfies this last inequality, so to that end,
let $\tau=\sup\{r>0:\|f(tx_1)\| ≤ (M + \epsilon)t\|x_1\|\text{for all}\ t\in [0,r]\}$, so that $\tau\le 1$ and $\tau\in \{r>0:\|f(tx_1)\| ≤ (M + \epsilon)t\|x_1\|\text{for all}\ t\in [0,r]\}$ (because $f$ is continuous.)
Now, toward a contradiction, suppose that $\tau<1$ and choose $c>0$ and small enough so that $(\tau+c)x_1$ is on the segment.
Now, there is a $\delta' > 0$ such that
$\|f(x) − f(\tau x_1)\| \le (M + \epsilon)||x − \tau x_1\|<\epsilon$ whenever $\|x − \tau x_1\| <\delta'.$
So, since $\|(\tau + c)x_1 − \tau x_1\| = c\|x_1\|<\delta' $ if $c$ is small enough, we have, with $x=(\tau + c)x_1$,
$\|f((\tau+c) x_1)-f(\tau x_1)\| \le (M+\epsilon)\|(\tau+c)x_1-\tau x_1\|=(M+\epsilon)\|cx_1\|$
and so finally,
$f((\tau+c)x_1)\le (M+\epsilon)\|cx_1\|+\|f(\tau x_1)\|\le $
$(M+\epsilon)\|cx_1\|+ (M+\epsilon)\|\tau x_1\|=(M+\epsilon)(c+\tau)\|x_1\|$,
which contradicts the fact that $\tau$ is a supremum.