I have this confusion while studying indefinite integrals. Is dx a derivative of x or a notation?
If it is just a notation,how can we explain that the transformation from dx to du satisfies the operational law of differentiation?
I have this confusion while studying indefinite integrals. Is dx a derivative of x or a notation?
If it is just a notation,how can we explain that the transformation from dx to du satisfies the operational law of differentiation?
Let's backtrack a bit and think about this object we call $$A = \int f(x)\ \mathrm dx,$$ where $f$ is a given function. If you are studying integration at the moment you must have seen a picture that looks something like this:
That is, when we find the integral, we are essentially chopping off all of the little bits that lie under the curve, finding their individual areas, and summing them all up. Each of those little bits have length $\mathrm dx$; the symbol $\mathrm d$ really just specifies that together with the thing after it (in this case $x$, hence $\mathrm dx$), is an indefinitely small quantity - as tiny as practically possible. Clearly, if we break $x$ into very little bits $\mathrm dx$, and sum them all up together, we get $x$. How do we express this sum? The symbol $\int$. So we do write $$\int \mathrm dx = x.$$
Here's a hopefully helpful example. Think of your entire life; let's say you're exactly 13 years old at the moment; not a second older. This means that you've lived for 409,968,000 seconds in total. If you like, you can think of every passing second as a little bit, and your entire life as being composed of 409,968,000 such little bits.
Now let's go back to $A$. Think about the curve above. We have broken the line from $a$ to $b$ into little bits each of length $\mathrm dx$, and by doing so we have formed the little rectangles going from the $x$ axis up to the curve $f$. Thus the height of these rectangles is whatever $f$ is for that $x$; i.e. $f(x)$. The area of a rectangle is the product of its height and width, so the area of each of the really thin rectangles we have created is $\mathrm dA = f(x) \cdot \mathrm dx$, where we've again written $\mathrm dA$ because we expect this area to be really small. The total area under the curve is the sum of each of these tiny little areas, namely $$A = \int \mathrm dA = \int_a^b f(x)\ \mathrm dx.$$
As you learn more mathematics, you will see various notions and interpretations for the quantity $\mathrm dx$, which is more formally called the differential element for some $x$ associated with a certain space. In fact, if you know some linear algebra, it is true that the differentials $\left(\mathrm dx_1\right)_p, \ldots, \left(\mathrm dx_n\right)_p$ at a given point together form a basis for the vector space of linear maps from $\mathbb{R}^n$ to $\mathbb{R}$, and so if a function $f :\mathbb{R}^n \to \mathbb{R}$ is differentiable on all of $\mathbb{R}^n$, the differential at $p$, $\mathrm df_p$, is a linear combination of these basis elements, that is, $$\mathrm df_p = \sum_{i=1}^n D_if(p)\left(\mathrm dx_i\right)_p \implies \mathrm df = \frac{\partial f}{\partial x_1} \cdot \mathrm dx_1 + \ldots + \frac{\partial f}{\partial x_n} \cdot \mathrm dx_n.$$ This is a bit of a roundabout way to show that $$\mathrm df = \frac{\mathrm df}{\mathrm dx} \cdot \mathrm dx.$$
It is merely a notation. What is meant by this expression is just a function such that its derivative is $f$, and this function should be evaluated at $x$. If you include other variables, it is important to distinguish which one is the argument of the function, in a sense what variable is being integrated.
The notation for indefinite integral is rather weird and confusing in my opinion, and I try not to use it. But it is popular in calculus courses.
The $dx$ occurring in the integral is not merely a notation. In the historical infinitesimal calculus, one would partition the domain of integration $[a,b]$ into, say, equal subintervals of infinitesimal length $dx$, with partition points $a=x_0, x_1,\ldots, x_H=b$ where $H$ is an infinite (in modern terminology, unlimited) integer, and $dx=\frac{b-a}{H}$. Then form the corresponding sum $S_H=\sum_{i=1}^H f(x_i) dx$ of infinitely thin rectangles of varying height $f(x_i)$. Such a sum is not exactly the integral; the integral is obtained by discarding an infinitesimal term, or more precisely by taking the standard part of the sum $S_H$: $$\int_a^b f(x) dx = \text{st}\Big(\sum_{i=1}^H f(x_i) dx\Big).$$ As you see, retaining the $dx$ as part of the notation for the definite integral is quite appropriate. Naturally, it is retained for indefinite integrals, as well. It is also essential if one is to get reasonable-looking formulas for change of variables in integrals.
Today's non-infinitesimal approaches cannot explain the notation in this way, and therefore tend to resort to explanations in terms of 1-forms. In modern infinitesimal calculus, $dx$ is naturally an infinitesimal occurring in the Riemann sum $S_H$ (and therefore naturally also in the integral). For details, see Keisler's textbook for infinitesimal calculus.