If the following two assumptions are provided:
A. $\displaystyle \lim_{x \to a^+} f'(x)=L$
B. $f$ is continuous at $a$,
could someone please explain how exactly this resulting statement is constructed: $$\lim_{x \to a^+}f'(x)=\lim_{x \to a^+}f'(\eta_x)$$, where $\eta_x$ is described as the value in $(a,x)$ that is provided by the Mean Value Theorem with the following property: $f'(\eta_x)=\frac{f(x)-f(a)}{x-a}$.
I have an attempt at explaining this claim, but I believe it requires the axiom of choice...so I am unsure if it is correct / the best way of formally explaining this result.
Attempted Explanation
The assumption that $\displaystyle \lim_{x \to a^+}f'(x)=L$ implicitly tells us that there is some $b \gt a$ such that $f'$ is defined on the interval $(a,b]$.
The assumption that $f$ is continuous at $a$ let's us invoke the Mean Value Theorem on the interval $[a,b]$ and any subinterval of the form $[a,x]$.
By the Mean Value Theorem, we know that for any $x \in (a,b]$, there is an $\eta$ such that $f'(\eta)=\frac{f(x)-f(a)}{x-a}$ and $\eta \in (a,x)$
From 3., we can define the following function $M$ whose domain is $(a,b]$ and whose mapping rule is: $M(x)=\eta$ such that $f'(\eta)=\frac{f(x)-f(a)}{x-a}$ and $\eta \in (a,x)$. $\color{red}{\text{NOTE}}$ - I believe this function definition requires one to rely on the Axiom of Choice, because it may be the case that there are an infinite number of different values that can satisfy the relevant properties for a given $x$ input. The easiest example I can think of is a linear equation.
With this definition, we can now prove that $\displaystyle \lim_{x \to a^+}f'(x)=L \rightarrow \lim_{x \to a^+}f'\circ M(x)=L$ (which is quick and painless)
Consider an arbitrary $\varepsilon \gt 0$ and the corresponding $\delta_{\varepsilon}$ given to us by assumption. We know that for any $x \in (a,a+\delta_{\varepsilon}):|f'(x)-L|\lt \varepsilon$. Choose any $x \in (a,a+\delta_{\varepsilon})$. Let $\delta_1=x-a$. Then, for all $z \in (a,a+\delta_1):M(z) \in (a,a+\delta_1) \subset (a,a+\delta_{\varepsilon})$, which implies that for any $z \in (a,a+\delta_1): |f'\left(M(z)\right)-L| \lt \varepsilon$.
This is what is meant by $\displaystyle \lim_{x \to a^+}f'(x)=\lim_{x \to a^+}f'(\eta_x)$.
Edit: This post is meant to specifically look at the equality $\displaystyle \lim_{x \to a^+}f'(x)=\lim_{x \to a^+}f'(\eta_x)$...as it is through this equality that we arrive at the conclusion: $\displaystyle \lim_{x \to a^+}f'(x)=f'^+(a)$ because $\lim_{x \to a^+}f'(\eta_x)=\lim_{x \to a^+}\frac{f(x)-f(a)}{x-a}:=f'^+(a)$