33

I'm currently learning calculus (high school senior), and I am not comfortable with the idea that the limit of the sums of rectangles actually converges to the area under the curve. I know it looks like it does, but how do we know for sure? Couldn't the tiny errors beneath/over the curve accumulate as we add more and more rectangles? What's troubling me is the whole Pi = 4 thing with the staircase approximating a circle pointwise, and how it's wrong and the perimeter of the staircase shape does not approach the circumference of the circle, even though pointwise it does approach a circle. So how are the increasingly many, increasingly small errors in Riemann sums any different? How do we know the error in each step decreases faster than the number of errors increases? I would really like to see a proof of this.

Thanks so much!

  • 14
    This is usually because the real area is bounded from above by the Riemann upper sums and from below by the lower Riemann sums, and in the favourable cases (the so-called Riemann integrable functions) , these upper and lower sums converge to the same limit, so we can apply the squeeze theorem. – Bernard Mar 01 '20 at 00:17
  • 1
    Oh okay, that makes sense, but how do we know the upper and lower Riemann sums converge? – Peter Caphulburg Mar 01 '20 at 00:22
  • 9
    It does not always converge. This depends on the function. It works for continuous functions on a bounded interval, or for monotonic functions.. – Bernard Mar 01 '20 at 00:24
  • That's interesting. Do you think the formal proof of this is understandable at my level? – Peter Caphulburg Mar 01 '20 at 00:42
  • 2
    If I recall correctly: the whole “$\pi=4$” thing has to do with arc length not necessarily behaving regularly under a limit. That is to say a sequence of curves may converge pointwise to another curve but their arc-lengths may not. – Nap D. Lover Mar 01 '20 at 01:47
  • @NapD.Lover: That's hardly a satisfying explanation, it's literally just a rephrasing of the paradox – BlueRaja - Danny Pflughoeft Mar 01 '20 at 09:12
  • If I remember well, it relies, for continuous functions, on Weierstraß' theorem that a functionwhich in continuous on a closed and bounded interval $[a,b]$ is bounded and attains its bounds on the interval. Not sure the proof of this theorem is really easy at the high-school level, but its meaning is quite intuitive. For monotonic functions, whether continuous or nt, the proof is straightforward (uses the monotone convergence theorem). – Bernard Mar 01 '20 at 10:10
  • I wonder if you'd like reading the book Calculus by Spivak to see a rigorous development of this material. – littleO Mar 02 '20 at 18:05
  • Interesting, I'll look into that book and see if I can get it. – Peter Caphulburg Mar 03 '20 at 00:46

4 Answers4

29

This actually is a very good question and something that really takes a two-prong answer to fully do it justice.


PART 1.

You are right to be skeptical and, I'd say, this is yet another of those times where that the "standard" maths curriculum and expositions do a really great job of making something easy, hard. You should trust your gut here, not your teacher.

The answer is simple: no, you cannot "prove" this, unless you have an independent formal definition of "area" that is separate from the Riemann integral and comprehensive enough to handle these situations on its own. Such a definition can be made: it's called Lebesgue Measure, but it takes a bit more mathematical machinery than is available at this point in exposition to make it cook.

Basically, Lebesgue measure is a function that takes a single input argument which is an entire set of points on the plane, i.e. $S$, interpreted as a solid area and not merely the boundary, and tells you its area, $\mu(S)$. This $S$ would be, in your case, the solid plane figure that is colored in on the graphs in your calculus textbook as being the "area under the curve". There is no integrals involved in its definition though, as I'd say, we'd need to expose a considerable amount of new machinery to build this. But if you do that, then take my word that you can prove that

$$\mu(S) = \int_{a}^{b} f(x)\ dx$$

where the right hand is a Riemann integral and the $S$ is as I described for this particular situation, when said Riemann integral exists.

ADD: As @Paramanand Singh mentions in the comments, there are simpler ways to define the area that may be more digestible at this point, though they do not cover as many cases as the Lebesgue measure. The Borel measure and Jordan pseudo-measure are two such options and I could attempt to describe them here if you wish or, you could ask another question in the vein of "What is a simple, integral-free definition of the area of a complicated plane figure that is digestible at or near the level of introductory calculus?" and I could then answer it with one or both of these.


PART 2.

This, of course, leads one to what a better way to introduce the integral should be, given we cannot at this stage do the necessary proof. And moreover, even if we could, then it would lead one to scratch one's head as to just why exactly we care to be creating this idea of "Riemann integral" in the first place when we already have a perfectly-good working construct for area.

And so, what I'd say is that a superior approach is to say that the Riemann integral is an explicit method to reconstruct a function from its derivative and, to make this clearer, we also need a better intuitive understanding of what a "derivative" means beyond the "tangent line" business that, while actually not bad at all, is itself also ruined by a poor explanation, too and which I could add still more details to get into, but I want to try and keep focus on the problem at hand. As Deane Yang mentioned in what was one of the posts that had a great influence in shaping my present attitude towards maths and especially maths education, here:

a better intuitive model for the "derivative" is that it is a kind of "sensitivity measurement": if I say that the derivative of a real-valued function of a real variable, $f$, has the value at the point $x$ of $f'(x)$, what that means intuitively is that if I "wiggle" $x$ back and forth a little bit, i.e. $\Delta x$, back and forth about this value, and I watch then the output value of $f$, i.e. $f(x)$, as though $f(x)$ were some instrument with a readout and $x$ a dial we could turn back and forth, then this $f(x)$ will likewise "wiggle" some other amount, i.e. $\Delta y$, and that

$$\Delta y \approx f'(x)\ \Delta x$$

provided $\Delta x$ is small - the accuracy of the approximation becoming as good as we like it if we make $\Delta x$ suitably smaller than whatever value we've been using so far: hence why we need to pass to a limit, a concept that, once more, can use some further elucidation. Or, to turn it around, $f'(x)$ is the "best" number to represent how much the output changes proportionally to the input, so long as we keep the input change small enough.

The Riemann integral, then, is the answer to this question:

  • Give me a procedure that, if we are first given the derivative, $f$, to find a function $F$ that has it as its derivative, with the initial information that $F(a) = 0$, for some selected point $a$.

That is, it is in effect a constructive way to solve what in differential equations terminology would be called the initial-value problem, or IVP,

$$\frac{dF}{dx} = f, F(a) = 0$$

that proceeds as follows.

We are given the only starting information that $F(a) = 0$, and that $F' = f$. So suppose we are to construct the value of $F$ at a new point $b$ for which $b > a$. How may we try this, given what we've already discussed?

So now, think about what I just said about the meaning of the derivative, and ask yourself this question:

I know that $F'$ here is how sensitive it is to a small change. So suppose I were to now do a Zeno-like manoeuvre and hop a small amount $\Delta x$ from $a$ rightward along the real number line to $a + \Delta x$. What then should we guess for $F(a + \Delta x)$?

Well, if you got what I just mentioned, then you should come to that, since $F'(a)$ is proportionately how much $F$ will respond to a small change in its input around $a$, and what we are doing is exactly that: to make such a small change from $a$ to $a + \Delta x$, then we should likewise shift $F(a)$ to $F(a) + (F'(a) \Delta x)$, so that

$$\begin{align} F(a + \Delta x) &\approx F(a) + [F'(a)\ \Delta x]\\ &= F(a) + [f(a)\ \Delta x]\end{align}$$

. And then, we can do the same thing, and make another small "wiggle" from $a + \Delta x$ to $[a + \Delta x] + \Delta x$ (i.e. $a + 2\Delta x$), and we get

$$\begin{align}F([a + \Delta x] + \Delta x) &\approx F(a + \Delta x) + [F'(a + \Delta x)\ \Delta x]\\ &= F(a) + [f(a) + \Delta x] + [f(a + \Delta x)\ \Delta x]\end{align}$$

and if you continue on this way all the way until we get to $b$, or at least as close as possible, you see we have

$$F(b) \approx \sum_{n=0}^{N-1} f(a + n\Delta x)\ \Delta x$$

or, letting $x_i := a + i\Delta x$,

$$F(b) \approx \sum_{i=0}^{N-1} f(x_i)\ \Delta x$$

. Moreover, we can generalize this a bit further still to allow for irregular steps, which increases the flexibility a bit for, say, mildly discontinuous input functions $f$, and so we get

$$F(b) \approx \sum_{i=0}^{N-1} f(x_i)\ \Delta x_i$$

and we're almost there, all it takes now is a limit to get to...

$$F(b) = \lim_{||\Delta|| \rightarrow 0} \sum_i f(x_i)\ \Delta x_i$$

and if we introduce a bit of notation for this new concept now...

$$\int_{a}^{b} f(x)\ dx := \lim_{||\Delta|| \rightarrow 0} \sum_i f(x_i)\ \Delta x_i$$

which is...?

And by the way, is the Fundamental Theorem of Calculus much of a "surprise" now, or almost tautological, something that was by design, not a mystery to be solved? (This is pattern I also find crops up elsewhere where the best motivating reason for something is put after and not before - e.g. Cayley's theorem in abstract algebra.)

That is, the real surprise is not that we can use the Riemann sum to find an antiderivative - that's its whole point - but that this sum also can describe an area and that is, indeed, much less trivial to prove.

  • For the IVP, you have $\dfrac{\mathrm dF}{\mathrm dx} = f$. (Great post by the way, +1!) – KM101 Mar 01 '20 at 01:39
  • 3
    Whoa. Your post blew my mind. This sentence was where it clicked for me: "the Riemann integral is an explicit method to reconstruct a function from its derivative." From where I'm at right now with my current knowledge, that makes so much more sense than the area under the curve, and is actually beautiful. I look forward to learning the proof that the sum agrees with the area some day, and it is satisfying to me to know that area defined independently exists and is equal to the definition by the integral. Thank you so much for taking the time to write this out, I really appreciate it. – Peter Caphulburg Mar 01 '20 at 02:11
  • 1
    @Peter Caphulberg : THANKS!!!! I am happy and glad that my insights really are as useful and helpful as they seem they would be in my mind! :) ❤️#Meep. – The_Sympathizer Mar 01 '20 at 02:55
  • 1
    @Peter Caphulberg : One thing I just realized I should add to that: Of course, the equivalence between Riemann integral and area only holds entirely directly when the function is non-negative and, indeed, in measure theory, we have to account for that more explicitly, as one might expect, in connecting measure to integral. – The_Sympathizer Mar 01 '20 at 02:57
  • 1
    @Peter Caphulberg : Moreover, from this vantage point, one should take the "rectangles" demonstration as showing that you can use an integral to find areas, but not as the fundamental meaning of the integral. Indeed, the fact of the positive/negative business, itself, is kind of a gentle pointer in that direction :) – The_Sympathizer Mar 01 '20 at 02:58
  • 2
    @Peter Caphulberg : And hence also why I am "The Sympathizer" - because I sympathize and empathize with those who struggle esp. given the obstacles I ran into learning this stuff and I could feel such ❤️empathy❤️ for other little ❤️Peeps❤️ and want to create quality resources they can use to get what I couldn't. ❤️#meep. – The_Sympathizer Mar 01 '20 at 03:03
  • 1
    Oh okay, thanks for the clarification on the equivalence between the Riemann integral and area. So if I understood correctly it's like the integral is a tool, and one of its uses is finding the area under the curve, but the fundamental meaning is a way to reconstruct a function from its derivative. That's a really cool idea. – Peter Caphulburg Mar 01 '20 at 03:47
  • 1
    @Peter Caphulburg : The integral is a tool. Indeed, as are many other mathematical devices like it. Just like addition: addition is a tool you can use to figure out how much stuff you'll have if you have two piles of stuff and put them together without having to measure the third pile. Numbers are tools to state in precise terms how big or small those piles are. Eventually, you can build enough tools to describe the deepest secrets of the universe, to create life-saving medicines, to pose and ponder profound philosophical questions, to make beautiful artwork (look up "fractal flame"), or ... – The_Sympathizer Mar 01 '20 at 04:04
  • 2
    ... perhaps the rather more profane one: get rich :) And if you were wondering "gee why am I learning maths" - now you know! And the problem is the teachers, not Maths! – The_Sympathizer Mar 01 '20 at 04:05
  • Wow, thanks so much! I hadn't thought about it that way before, that mathematical devices are tools that describe deep concepts in the world, like finding the size of the third pile when you add two piles of stuff or finding your speed at any given time. I guess that's why people say math is the language of the universe; this tool point of view really makes math seem like a language to me. I just looked up fractal flames, they're beautiful! They look complicated to create, I wonder what types of mathematics are involved there. – Peter Caphulburg Mar 01 '20 at 04:30
  • 1
    @Peter Cpahulberg : you're welcome and I'm feeling a little teenieweeniebittie embarrasseyticklepinkeyinsidefromallthepraiseypraisey:(:):(:)):):) – The_Sympathizer Mar 01 '20 at 07:30
  • 1
    I hope and want that I will be able to teach many to see Maths this way and to make it a subject that is more loved. Of course, they say not everyone can be a Gauss but neither can everyone be a Picasso either and yet for some reason more people seem to love Art than Maths so I find that strange that some seem to use that as a way to suggest something different about Maths... – The_Sympathizer Mar 01 '20 at 07:32
  • 3
    +1 for excellent answer. Two points I wish to add: for a simplified treatment of Area one can replace Lebesgue measure with Jordan content which makes the connection with Riemann integral almost obvious. Second is about your comments regarding use of maths. All forms of human study have practical value and this is especially the case with science, math. But there is an inherent intellectual enjoyment in these things which is the major motivation. The enjoyment is almost comparable to what we get when we read a good novel, watch a good movie or listen to a nice song. – Paramanand Singh Mar 01 '20 at 07:53
  • @Paramanand Singh: Added mention for simpler area constructs. – The_Sympathizer Mar 02 '20 at 11:13
  • I'll ask a new question with the title you suggested. I'm looking forward to learning about an integral-free definition of areas :) – Peter Caphulburg Mar 03 '20 at 00:48
  • @Peter Caphulburg : Thanks. – The_Sympathizer Mar 03 '20 at 02:07
  • @PeterCaphulburg If you've made the new post, could you add a link in the original post? – Alice Ryhl Mar 03 '20 at 18:34
  • You can apply the Riemann rectangle sum method to figures whose areas are known, such as triangle, rectangle, circle, parabola, etc., and you will find that you get the correct formula for area. That proves nothing, but should give you a good feeling. – richard1941 Mar 09 '20 at 20:53
9

The Riemann integral (really Darboux' definition, as that is what is usually taught as "Riemann integral") is a purely analytical construction. It does not have any relation to "areas". What is important is that it satisfies some of the properties the "area under the curve" has (where it makes sense).

Sure, you can interpret the integal as the area below a curve, but Riemann's integral was defined precisely to handle (some) cases where that makes little sense, in a rigorous manner (Newton's, Leibnitz' "definitions" were quite handwavy). Note also that later definitions of integral (Lebesgue's, Stieltjes') have little or no visual connection to "areas", are even defined for cases where area makes no sense whatsoever.

J. W. Tanner
  • 60,406
vonbrand
  • 27,812
  • 1
    That's really cool. I have enjoyed learning about how they are related to distance and physics problems as well. I'm glad to hear that integrals have interpretations other than areas under curves. What are those cases where area makes no sense? – Peter Caphulburg Mar 01 '20 at 00:51
  • 4
    @PeterCaphulburg Integrals have all kinds of useful applications if you're a scientist or engineer. Virtually none of the practical applications I've ever run across actually have anything to do with area. For example, here's a question from someone trying to learn electromagnetic theory: https://math.stackexchange.com/questions/916569/intuitive-explanation-for-integration – David K Mar 01 '20 at 01:58
  • @PeterCaphulburg, one is accustomed to functions that are "nice" (continuous, continuous derivatives everywhere, ...). However, there are strange beasts too. What should be the area if the function contains a jump? It doesn't "enclose" anything if it is 1 to just before 1 and 2 afterwards. What if it has a "hole" (0 if $x = 1$, else 2)? Or it just spikes out of sight, like $x^{-2}$ at zero? Another classical example is $f(x) = 0$ if $x$ is rational, 1 if $x$ is irrational. Or look up Thomae's function. – vonbrand Mar 01 '20 at 04:03
3

In most, if not all, Calculus textbooks, the area under the curve is actually DEFINED to be the appropriate definite integral.

3

I think that Bernard made a good point in the comments under your question. So let me compile that into a short answer.

The trick is in working not with one, but with two slightly different Riemann sums - the lower and the upper Riemann sum. Consider the following figure:

enter image description here I think we can agree that the lower Riemann sum $\check\Sigma$ (in the left figure) underestimates the actual area under the curve, and that the upper Riemann sum $\hat \Sigma$ (in the right figure) overestimates the actual area under the curve.

So we can agree that the actual area $A$ under the curve is somwhere between these sums:

$$\check\Sigma\le A\le \hat\Sigma.$$

Now, if we make the Riemann sum finer and finer (adding more rectangles, and making them thinner), the hope is that the lower and upper value come closer and closer to each other, and converge to a common value. If that would be the case, we could agree that this common value must be $A$, as $A$ was at all times trapped between these both sums.

So how do we know that the lower and the upper Riemann sum converge to the same value? The answer is: we don't! They do, in general, not converge to the same value! But they do for some functions, and these functions are then called Riemann integrable. These are the functions for which the Riemann sum is a valid tool to compute the area under the curve.

So the problem was now shifted to the question: "which functions are Riemann integrable?". And this is indeed a non-trivial question. Luckily, most functions that appear in everyday applications are in fact Riemann integrable. For example, continuous functions (on bounded intervals) or monotone functions are Riemann integrable, and one can prove this rigorously. This is usually done in an introductory analysis course for mathematicians. If you are interested in a proof: Wikipedia contains a proof of the fact that a functions is Riemann integrable (on a compact interval) if and only if it is continuous almost everywhere.

M. Winter
  • 29,928