7

During integration by substitution we normally treat infinitesimals as real numbers, though I have been made aware that they are not real numbers but merely symbolic, and yet we still can, apparently, treat them as real numbers. For instance, consider we want to integrate the expression $3x(x^4+1)^3$. A common way to do this is to let $u=x^4+1$, where $\frac{du}{dx}=4x^3$, and thus $du=4x^3dx$ which is appropriately used in our substitution to obtain $\int3x(u)^4 du$, and then we simply directly integrate this new integrand. However, while I understand the process and why we do it in such a manner, I am perplexed as to why we can still rigorously treat the infinitesimals as real numbers. So, my question is if anyone can elaborate on exactly why it is logically rigorous to treat infinitesimals as real numbers during substitution for integration.

(Note: My question does not concern as to what "dx" means in integration simply because my question is defined in the prospect of treating infinitesimal derivatives as ratios specifically in integration by substitution, where other questions do not specifically address. )

  • 3
    When doing computations of this sort you are manipulating terms according to certain rules. This has nothing to do with the "fine structure" of ${\mathbb R}$, including or excluding infinitesimals. – Christian Blatter Jul 05 '15 at 08:22
  • @ChristianBlatter Yes, but my question was regarding the justification of these rules, which I believe does include infinitesimals. – Reinhild Van Rosenú Jul 05 '15 at 09:13
  • You can easily explain integration by substitution without mentioning infinitesimals, just note that an antiderivative of $ f (u (x)) u'(x ) $ is $ F (u (x)) $, where $ F $ is an antiderivative of $ f $. We've merely used the chain rule in reverse. – littleO Jul 05 '15 at 10:30
  • 1
    This is not specific to integration but to differentials in general. –  Jul 05 '15 at 10:38
  • 1
    So, my question is if anyone can elaborate on exactly why it is logically rigorous to treat infinitesimals as real numbers There is nothing necessarily non-rigorous about infinitesimals. The most common way of making them rigorous is through non-standard analysis (NSA). In NSA, infinitesimals obey all the same elementary axioms of arithmetic that the real numbers obey. –  Jul 05 '15 at 13:22

4 Answers4

11

The issue is that there is a whole bunch of measure theory hidden behind those "legitimate" calculations. If you just learned how to integrate elementary functions, you have too much to learn before you get there.

The key in this is the Radon-Nikodym Theorem, which says that if you have two measures $\mu,\nu$ on a measurable space $X$ such that $\mu$ is absolutely continuous with respect to $\nu$, then there exists a unique density function (which we denote by $\frac{d\mu}{d\nu}$) such that for any function $g$ integrable with respect to both $\mu$ and $\nu$, $$ \int_X g \, d\mu = \int_X g \frac{d\mu}{d\nu} \, d\nu. $$ The function $\frac{d\mu}{d\nu} : X \to \mathbb R$ is called the Radon-Nikodym derivative of $\mu$ with respect to $\nu$.

So in other words, in the rigorous treatment, the infinitesimals are not treated as infinitely small quantities, but rather as measures; while doing a change of variables, it can be shown that the Radon-Nikodym derivative you obtain is in fact the derivative of the function you use to change variables (i.e. in $du = f'(x) dx$, $\, f'(x) = \frac{du}{dx}$ where $du$ and $dx$ are two measures on the real line and $u = f(x)$ ; $f$ must be a diffeomorphism).

Now let me assume that this was not satisfactory for you; there is also the field of mathematics called non-standard analysis, which defined hyperreal numbers; I suggest you look at the Wikipedia page on this for more details. It allows the treatment of infinitesimals as quantities so you can play around with them.

Hope that helps,

  • Thank you, I guess I I have to do a bit of research, but can you roughly state at what year of University (pure mathematics) is one likely to encounter measure theory and thus u der stand what you have said regarding it? – Reinhild Van Rosenú Jul 05 '15 at 09:08
  • 1
    It depends on the universities, but look at your universit's curriculum to check at what level they teach Measure theory. I know that in Canada it's a graduate course, but where I'm studying at right now (in Berlin) it's a second year course. Normally after a first course in analysis and Riemann integration you are ready for that. – Patrick Da Silva Jul 05 '15 at 09:47
3

The justification of substitution method for finding indefinite integrals is the chain rule of derivative. Suppose $F(x)$ is anti-derivative of $f(x)$ so that $$F(x) = \int f(x)\,dx$$ and $F'(x) = f(x)$. Now if we put $x = g(t)$ then and apply chain rule we get $$\frac{d}{dt}\{F(g(t))\} = F'(g(t))g'(t) = f(g(t))g'(t)$$ and hence we get $$F(g(t)) = \int f(g(t))g'(t)\,dt$$ or the rule of substitution for indefinite integration.

As you can see that it has nothing to do with infinitesimals or differentials. In order to memorize this rule we sometimes give informal version: Let $x = g(t)$ so that $dx = g'(t)\,dt$ and hence $$\int f(x)\, dx = \int f(g(t))g'(t)\,dt$$ So this kind of language is used only to help remember the rule.

It is better not to give the $dx$ in both $d/dx$ and $\int\,dx$ notation a proper meaning. They are best suited for aid in remembering rules of calculus. Although it is possible to give a formal meaning to these symbols (what we call differentials) but it does not lead to any concept which is more useful than derivatives or integrals.

1

Things work once all definitions are made:

1. We define what the differential is: $$df = f'(x) \, dx$$

2. So all rules of derivative hold, including the change of variable (chain rule): $$f'(g(x)) = f'(g) \cdot g'(x) \Rightarrow df = f'(g) \cdot g'(x) \, dx$$ or equivalently: $$\frac{df(g(x))}{dx} = \frac{df(g)}{dg} \cdot \frac{dg(x)}{dx}$$

3. We define (indefinite) integration as inverse of derivation operator. $$(Integral \, (f(x)))' = f(x)$$

4. We define what (definite) integral is: the sum of area of rectangles that upper and lower bound of the function. The smaller is width of rectangle the better is the precision of calculation of integral. With the width $\to 0$ we can call it $dx$ to indicate it's small. We indicate the operator as $$\int_a^b f(x) \, dx$$

5. Using the fundamental theorem of calculus we demonstrate $Integral(f(x))$ operator gives same results of $\int_a^x f(t) dt$. We will call of operator $\int f(x) \, dx$ to indicate we interested in function and don't care about integration r.

Now theoretic machine is over and we can demonstrate the change of variable formula: $$ f(g) = f(g(t)) \Rightarrow \int df(g) = \int df(g(t)) \Rightarrow \int f'(g) \, dg = \int f'(g) \cdot g'(t) \, dt $$

Blex
  • 1,744
1

It is more appropriate to speak about differentials than infinitesimals; in some books they are treated as linear maps.

There are rigorous computation laws involving differentials, such as

$$d(u+v)=du+dv,$$ $$d(uv)=udv+vdu,$$ or $$\frac{du}{dv}\frac{dv}{dt}=\frac{du}{dt}.$$

These are established in the theory of derivation.

It turns out that some look like operations on reals, which probably why this notation has been adopted. By you may not blindly extrapolate to other operations (say like $\sqrt{du^2}=|du|$ ?!)