I have some questions about the following theorem and it's proof.
Theorem. Let $X\subset \mathbb R^n$, open, convex, bounded and $f_n:X \to \mathbb R^m$ differentiable.And let's consider the following:
- There exist $H$: $\mathbb R^n\rightarrow L(\mathbb R^n,\mathbb R^m)$ such that $Df_n \rightarrow H$ uniformly in X, where $Df_n$ is the differential of $f_n$.
- There exist $x_0 \in $ X such that the sequence $\{f_n(x_0)\}$ converges.
Then there exist $f:X \rightarrow \mathbb R^m$ such that $f_n \rightarrow f$ uniformly over X, $f$ is differentiable and $Df(x)=H(x)$, for all $x\in X$.
Proof. Notice that \begin{align*} & \frac{f(x)-f(x_{0})-H(x_{0})\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}% \color{fuchsia}=\frac{f(x)-f(x_{0})-[f_{n}(x)-f_{n}(x_{0})]}{\Vert x-x_{0\Vert}}\\ & +\frac{f_{n}(x)-f_{n}(x_{0})-\nabla f_{n}(x_{0})\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}+\frac{(\nabla f_{n}(x_{0})-H(x_{0}))\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}\\ & =:I+II+III. \end{align*} Since $X$ is convex, by applying the mean value theorem to the function $$ g_{n,m}(t)=f_{m}(tx+(1-t)x_{0})-f_{n}(tx+(1-t)x_{0}),\quad t\in\lbrack0,1] $$ $\color{fuchsia}{**...(2)**}$
there is $t_{0}$ such that \begin{align*} & f_{m}(x)-f_{m}(x_{0})-[f_{n}(x)-f_{n}(x_{0})]=g_{n,m}(1)-g_{n,m}(0)\\ & =g_{n,m}^{\prime}(t_{0})=(\nabla f_{m}(z_{0})-\nabla f_{n}(z_{0}% ))\cdot(x-x_{0}), \end{align*} where $z_{0}=t_{0}x+(1-t_{0})x_{0}$. By uniform convergence of the gradients, ...(1) $$ \Vert\nabla f_{m}(z)-\nabla f_{n}(z)\Vert\leq\Vert\nabla f_{m}(z)-H(z)\Vert +\Vert\nabla f_{n}(z)-H(z)\Vert\leq2\varepsilon $$ for all $n,m\geq n_{\varepsilon}$ and all $\color{fuchsia}z\in X$. Hence, by Cauchy's inequality \begin{align*} \left\vert \frac{f_{m}(x)-f_{m}(x_{0})-[f_{n}(x)-f_{n}(x_{0})]}{\Vert x-x_{0\Vert}}\right\vert & =\left\vert \frac{(\nabla f_{m}(z_{0})-\nabla f_{n}(z_{0}))\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}\right\vert \\ & \leq\Vert\nabla f_{m}(z_{0})-\nabla f_{n}(z_{0})\Vert\leq2\varepsilon. \end{align*} Since $X$ is bounded, this inequality implies that \begin{align*} \vert f_{m}(x)-f_{n}(x)\vert\le|f_{m}(x_{0})-f_{n}(x_{0})|+\Vert x-x_{0}\Vert 2\varepsilon\le |f_{m}(x_{0})-f_{n}(x_{0})|+2M\varepsilon. \end{align*} and so $\{f_n\}$ is a uniform Cauchy sequence ...(2) and so it converges uniformly to a function $f$. Letting $m\rightarrow\infty$ $\color{fuchsia}{**...(3)**}$ we get $$ \left\vert \frac{f(x)-f(x_{0})-[f_{n}(x)-f_{n}(x_{0})]}{\Vert x-x_{0\Vert}% }\right\vert \leq2\varepsilon $$ for all $n \geq n_{\varepsilon}$. This takes care of $I$. Taking $n =n_{\varepsilon}\color{fuchsia}{**...(4)**} $ and using the fact that $f_{n_{\varepsilon}}$ is differentiable at $x_{0}$ we get that $$ \left\vert \frac{f_{n_{\varepsilon}}(x)-f_{n_{\varepsilon}}(x_{0})-\nabla f_{n_{\varepsilon}}(x_{0})\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}\right\vert \leq\varepsilon $$ for all $x\in X$ with $0<\Vert x-x_{0}\Vert\leq\delta_{\varepsilon}$. This takes care of $II$.
Lastly, by Cauchy's inequality $$ \left\vert \frac{(\nabla f_{n}(x_{0})-H(x_{0}))\cdot(x-x_{0})}{\Vert x-x_{0\Vert}}\right\vert \leq\Vert\nabla f_{n}(x_{0})-H(x_{0})\Vert \leq\varepsilon $$ for all $n\geq n_{\varepsilon}$. In conclusion we have that for all $x\in A$ with $0<\Vert x-x_{0}\Vert\leq\delta_{\varepsilon}$, $$ \left\vert \frac{f(x)-f(x_{0})-H(x_{0})\cdot(x-x_{0})}{\Vert x-x_{0\Vert}% }\right\vert \leq4\varepsilon $$ which implies that $f$ is differentiable at $x_{0}$ with $\nabla f(x_{0})=H(x_{0})$.$\color{fuchsia}{**...(5)**}$ By repeating the proof with $x_0$ replaced by any other point, we get that $f$ is differentiable in $X$. ...(4)
My questions are in $\color{fuchsia}{pink}$ color.
- At the beginning why it is that $H(x_0) (x-x_0)= f_{n}(x)-f_{n}(x_{0})$?
- How is $g_{m,n}$ defined? I think its domain is $[0,1]$ but I don't know which codomain has.
- What is the form of $z$, is it a simple vector in X, or has the form of $z_0$?
- Why do we take $m\to\infty$? Could have been $n$? And why taking $m\to\infty$ implies the next inequality?
- Why do we take the $n=n_{\epsilon}$? I really don't see this step.
- Why do we need to repeat the proof?? Isn't $x_0$ arbitrary?
- How can the proof be formal? At the beginning, it is stated that it must be always $\varepsilon>0$, and maybe the $x,x_0\in X$ must be given too? Or they must be given in the middle of the proof? Or where should they be? Or how?