if you really want to understand this, perhaps the only way is to go under the hood and see how a change of variables really works. A COV is not the fact that:
$$\int{U(x) dx } \,=\, \int{U(t) dt } \,=\, \int{U(i) di }.$$
This is merely notation; it is picking a letter.
Let's work with $f(x) \,=\, (4-x^2)^{-1/2}$ defined on $(-2,2)$. Because this function is continuous, it is the derivative of something. So we can talk about $F(x)$ defined on $(-2,2)$ with derivative $f(x)$.
Now let's look at our third function, $x(\theta) \,=\,2\sin \theta.$ We must note a few facts related to it.
- it is continuous on the interval $[-\pi / 2,\pi / 2]$.
- it is differentiable on the interval $(-\pi / 2,\pi / 2)$. (so we can have our fourth function $x'(\theta)$.)
- $x'(\theta)$ is strictly greater than zero inside its domain $(-\pi / 2,\pi / 2)$.
- we have a neat matchup of endpoints: $x(-\pi / 2) = -2, x(\pi / 2) = 2$.
All this indicates that $x(\theta)$ has an inverse function $\theta(x)$. Or in other words, the $x$'s are in one-to-one correspondence with the $\theta$'s. The upshot is, that although writing the following is the obvious choice
$$\int{f(x) dx } \,=\, F(x),$$
we are justified to rather take the point of view
$$\int{f(x) dx } \,=\, \Phi(\theta(x)).$$
(Here $\Phi$ performs the action: apply $F$ to the $x$ which corresponds to $\theta$.) Now perhaps you begin to see why a COV is more than just a notational thing. And, that this curious change in point of view is the key to making the substitution work. We need also the chain rule:
$$\frac{d}{d \theta} \Phi(\theta) \,\,=\,\, \frac{d}{d \theta} F(x(\theta)) \,\,=\,\, f(x(\theta))\,x'(\theta).$$
The third expression is a derivative in $(-\pi / 2,\pi / 2)$. Hence its integral is $\Phi(\theta)$. As your wiki page shows, $\Phi(\theta) = \theta.$
Lastly we say goodbye to $\theta$ by noting that $\theta(x) \,=\, \arcsin(x/2)$ and again that $\Phi(\theta(x)) = F(x)$.
I tried to do all this while avoiding any interpretation of 'dx'. Of course it is fine to think of $dx$ as the width of very skinny rectangles. But in the COV/substitution, you must think of the two intervals $-2 < x < 2$, $-\pi/2 < \theta < \pi/2$ and some differentiable bijection between them.