25

QUESTION: What are some simple math problems whose answers are highly unintuitive, and what makes them so?

There are plenty of unintuitive and frankly baffling results in math, like the Banach-Tarski paradox and this crazy MSE question which uses the axiom of choice to predict real numbers. However, these are pretty esoteric, and a layman might have trouble even understanding what exactly the question is asking. I’m more interested in examples like the Potato paradox:

Fred brings home $100$ kg of potatoes, which (being purely mathematical potatoes) consist of $99\%$ water. He then leaves them outside overnight so that they consist of $98\%$ water. What is their new weight? The surprising answer is $50$ kg.

I think I can explain why this answer seems unintuitive. Our intuition tells us that a small change in the water percentage should result in a small change in the mass of the potatoes. However, this heuristic is misleading in this case, in part because of the fact that $1/x\to \infty$ as $x\to 0$ and $1/x$ makes large “jumps” in value for $x$ close to $0$.

What are some other examples of simple problems with unintuitive answers? (I expect that there are plenty of examples that have to do with probability, since humans have terrible probabilistic intuition, and plenty of examples involving infinity, since people have a hard time conceptualizing the infinite.)

Also, please try to articulate exactly why you think your problem has an unintuitive answer, as I’ve attempted to do for the Potato paradox.

zhuli
  • 2,561
Franklin Pezzuti Dyer
  • 39,754
  • 9
  • 73
  • 166
  • Note: This question is certainly related, but most of the “unintuitive results” used as examples are rather convoluted. I’m looking for simpler ones like the Potato paradox. – Franklin Pezzuti Dyer Jun 05 '20 at 20:01
  • Unless you can give a mathematical definition of "intuistic" and "unintuistic", this is not a mathematical question, and doesn't belong here. –  Jun 05 '20 at 20:09
  • 5
    “$99:%$” by what? volume? amount [of substance]? That makes a big difference in whether or not it’s “intuitive.” If you’re talking about volume, I don’t find this shocking personally. Fun question though! +1 – gen-ℤ ready to perish Jun 05 '20 at 20:11
  • 2
    The "potato paradox" is not precisely defined. I wonder if there's a nicer formulation? – Mateen Ulhaq Jun 06 '20 at 04:22
  • The hydra game: see https://math.stackexchange.com/questions/2040811/what-are-some-counter-intuitive-results-in-mathematics-that-involve-only-finite/2042504#2042504. – Martín-Blas Pérez Pinilla Jun 06 '20 at 06:36
  • (In some sense) There are many more real numbers than rational numbers even though there is some rational number arbitrarily close to any real number and rational numbers seem to be "everywhere" – qwr Jun 06 '20 at 07:47
  • In fact it's not intuitive to me (or the ancient Greeks) at all that irrational numbers even exist. – qwr Jun 06 '20 at 07:49
  • 2
    @qwr: Even worse: Between any two irrational numbers you'll find a rational number. And yet there are far less rational than irrational numbers. – celtschk Jun 06 '20 at 10:07

7 Answers7

22

Here's another problem about the unintuitive effect of $\frac1x$.

You want to drive point A to point B and back at an average speed of 60 mph. However, on the way from A to B, there was traffic, which slowed you down to 30 mph. How quickly do you have to drive from B to A so that your average speed is 60 mph (over the entire round trip)?

A reasonable first guess is 90 mph, and then you might wonder if the true answer is a bit different, but actually the answer is a lot different:

It's impossible! If A and B are $\ell$ miles apart, then an average speed of 60 mph means going the $2\ell$ miles from A to B and back in only $\frac{2\ell}{60} = \frac{\ell}{30}$ hours. However, going from A to B at 30 mph already took $\frac{\ell}{30}$ hours, so the return trip would have to be done in zero time.

Exponential functions have an even more unintuitive effect (though we're currently all getting a crash course in those), and there's the traditional problem:

A population of algae is introduced into a lake on day 1. The algae grows very quickly, doubling in population (and in area covered) every day. On day 30, half of the lake is covered. At this rate, when will the algae cover the entire lake?

Maybe our first guess (because we expect all functions to be linear) is day 60 or 59 or something, but actually

The answer is day 31; doubling "half of the lake" just once gives us the entire area of the lake.

It's traditional to give three examples, so here's the birthday paradox. The math here is a bit fancier (though I'm giving the version that requires less calculation), but the statement is easy for anyone to understand:

A professor teaches a lecture class of about the same size every year. The roster lists everyone's birthday. The professor notices that on average, there is one pair of students per year with the same birthday. About how large are the professor's lectures?

There are $365$ days in most years, so we might expect that a sizable fraction of the year needs to be covered. However, the answer is only:

About $27$ or $28$ students per class. With $27$ students, there are $\binom{27}{2} = 351$ pairs of students; with $28$ students, there are $\binom{28}{2} = 378$. To get the average number of pairs that share a birthday, divide by $365$ (under the assumption that birthdays are uniform and February 29 doesn't exist, which is not far from the truth).

This paradox gets more surprising if we replace "birthday" with some other statistic that's uniformly spread over even more values, but I can't think of one that's also a reasonable piece of data for the professor to collect from the students.

Misha Lavrov
  • 142,276
  • I like the first example a lot. The second example didn’t throw me at all though personally; it’s just recursion. +1 – gen-ℤ ready to perish Jun 05 '20 at 20:13
  • Intuitions vary, and maybe the second example is less likely to surprise people visiting Math StackExchange. But that question is so often presented as a "gotcha!" question that I feel like many people do get surprised. – Misha Lavrov Jun 05 '20 at 20:14
  • 1
    Usually the question is asked the other way round: After a given time, half of the lake is covered, when is the full lake covered? That way it is far more effective. Also nice are cases where the covered percentage is small and the time so far is long; for example, after three months, 6.25% of the lake is covered, when will the full lake be covered? (6.25% = 1/16, therefore it will take just 4 more days). – celtschk Jun 05 '20 at 20:21
  • @celtschk Those cases are nice! I'm not going to mess around with small percentages, but I've rephrased the question to ask about going from 1/2 to 1. – Misha Lavrov Jun 05 '20 at 20:24
  • How do you calculate the pairs in the last example? – d-b Jun 06 '20 at 05:38
  • 1
    Which part are you asking about? $\binom{27}{2}$ is the binomial coefficient, and is equal to $\frac{27 \cdot 26}{2}$; we choose one student in $27$ ways, a second student in $26$ ways, and divide by $2$ to account for switching their order. – Misha Lavrov Jun 06 '20 at 06:06
  • I've never found the first one to be that tricky (compared to the potato example). I just imagine that the trip is 60 miles round trip, 30 one way. If I want to be 60 mph average I need to make the round trip in an hour, but I already spent an hour getting there (30 miles at 30 mph) so there's no way to beat an hour total anymore. – Alex Jones Jun 06 '20 at 06:14
  • @AniruddhaDeb That's true, but it doesn't actually change the calculation all that much. Using the frequency data you link to, the probability that two randomly chosen people have the same birthday is... approximately $\frac{1}{365.024}$. (The probability that two randomly chosen people from their study have the same birthday is approximately $\frac{1}{365.057}$, which is slightly lower because you can't choose the same person twice, eliminating a few cases that would guarantee a match.) – Misha Lavrov Jun 06 '20 at 13:39
9

Expected first entry time of a Brownian motion!

Let $B$ be a Brownian motion with start in $0$,

$a \in \mathbb{R}_{> 0}$ and $\tau_a$ be the first entry time of $a$, i.e.

$$ \tau_a := \inf\{t \geq 0 \mid B_t = a\}.$$

The value $\tau_a$ is almost surely finite, i.e. almost every path of the Brownian motion hits $a$ in finite time, but the expected value of $\tau_a$ is actually infinite!

You can proof this by considering $\inf\{t \geq 0 \mid B_t \in \{a, -a\}\}$.

image credit


  • 2
    The same is true for a discrete (symmetric) random walk on the number line, going from $k$ to $k+1$ or $k-1$ with equal probability at each time step. Any hitting time is finite with probability $1$, but has an expected value of $\infty$. – Misha Lavrov Jun 05 '20 at 21:25
  • @MishaLavrov I would upvote that as an answer – gen-ℤ ready to perish Jun 05 '20 at 21:44
  • 2
    In other words, even at a roulette table with a 0% house edge and an unlimited line of credit, it may take an extremely long time to earn even one dollar (if the table has a limit on bet sizes). – Tanner Swett Jun 06 '20 at 06:31
  • 2
    Exactly! Otherwise, if the expected value of that stopping time was finite, you would have a winning strategy for a martingale (just stop when you have earned one dollar). – NiklasvMoers Jun 06 '20 at 06:33
5

Regions of a circle

One of my favorites:

Choose n points around the circumference of a circle, and join every point to every other with a line segment. Assuming that no three of the line segments concur, how many regions does this divide the circle into?

There's a rather obvious pattern, that breaks down at $n=6$.


Bayes' theorem

Related to the example given in the question is the following:

There is a rare disease that only $0.1\%$ of the population have. Suppose you have a test that can determine if someone has this rare disease at a $99\%$ rate of accuracy. If you test positive for the disease, what is the probability that you have the disease?

Seems pretty obvious, $99\%$ right?

Suppose there are $1,000,000$ people in the population. $999,000$ don't have the disease, which means $9,990$ people falsely test positive. $1,000$ people have the disease, and $990$ correctly test positive. So out of all of the people who have tested positive, $\frac{990}{990+9990} = \frac{1}{11}$ actually have the disease!


Simpson's Paradox

Imagine you have two bins: $A$ and $B$.

$A$ contains $5$ white balls, $6$ black balls

$B$ contains $3$ white balls, $4$ black balls

You want to pull a white ball, but you can only pull once from a bin of your choice at random. Which bin would you pull from? Clearly, $A$ gives you better odds.

Imagine you have two bins: $C$ and $D$.

$C$ contains $6$ white balls, $3$ black balls

$D$ contains $9$ white balls, $5$ black balls

Now which bin would you choose? Clearly, $C$ gives you better odds.

Let's combine bins $A$ and $C$, and combine bins $B$ and $D$. Would you pull from the bin with $A$ and $C$, or the bin with $B$ and $D$? Seeing as how $A$ and $C$ were both the better choices, their combination must still be the correct choice right?

$AC$ contains $11$ white balls, $9$ black balls

$BD$ contains $12$ white balls, $9$ black balls


Gabriel's Horn

There exists a shape that has infinite surface area but finite volume. The fact that such a shape can even exist may be pretty unintuitive to begin with.

Even more baffling is the idea that you can paint an infinite surface area in a finite amount of time and paint. Simply fill the horn with an amount of paint equal to its volume (which is finite), pour all of the paint out, and the entire interior of the shape has now been painted!


Partial differential equations

"In order to eat as much as possible in a day, one should not eat as much as possible all day."

Sounds confusing and unintuitive at first, right?

If someone wanted to maximize their food consumption, rather than continuously consume food the entire day, it may be optimal to consume 3 large meals or 5 small meals instead. This is understood through partial differential equations: the rate of digestion may be dependent on various other factors such as the amount of food in the stomach or appetite.


Function asymptotic growth

The wheat and chessboard problem is very famous, and is a go-to example educators use to demonstrate the monstrous unexpected growth of exponential functions.

But without the understanding of the growth rate of functions, some other results are also surprising:

There are more possible chess games than there are atoms in the observable universe.

There are approximately $10^{80}$ atoms but approximately $10^{120}$ possible chess games.

All of the digits of the number $$9^{9^{9^9}}$$ cannot possibly be contained within the observable universe.

How could four simple $9$'s create such a large number?

A little bit more esoteric would be examples like Kruskal's tree theorem and $TREE(3)$, or certain Diophantine equations like the positive solutions of $\frac{a}{b+c} + \frac{b}{a+c} + \frac{c}{a+b} = n$.


Infinite ordinals

Goodstein sequences, when evaluated naively, seem like they would not only grow extremely quickly, but grow forever.

However, a very basic understanding of infinite ordinals is enough to directly map the sequence to an ordinal sequence, making the fact that the sequence must eventually terminate to $0$ quite unsurprising and obvious.

Someone in the comments of the question mentioned the hydra game, which can be understood with the same idea.


There may be a couple more good examples from this thread: Examples of patterns that eventually fail

zhuli
  • 2,561
  • “All the digits of the number $9^{9^{9^9}}$ cannot possibly be contained within our universe.” What does that mean? And said who? – gen-ℤ ready to perish Jun 06 '20 at 04:56
  • 2
    If you were to write out the base-10 representation of the value of that number, there is no remotely feasible way to write it down so that it will fit in the observable universe, even under the most ideal of situations and extremely loose estimates. The number of digits of $n$ is approximately $\log_{10}n$. The number of atoms that can fit in the entire space of the observable universe is far lower than $10^{1000}$. – zhuli Jun 06 '20 at 05:16
  • @Zhuli: If you mean the observable universe, you should write “observable universe”. The universe cannot currently be ruled out to be infinite (and therefore having space for arbitrary many digits). The observable universe is finite, and more importantly, has relatively well known size. – celtschk Jun 06 '20 at 09:03
  • Not the first time I've gotten that criticism, fixed. – zhuli Jun 06 '20 at 09:06
5

The birthday paradox. Quote from Understanding the Birthday Paradox:

In a room of just 23 people there’s a 50-50 chance of at least two people having the same birthday. In a room of 75 there’s a 99.9% chance of at least two people matching.

Put down the calculator and pitchfork, I don’t speak heresy. The birthday paradox is strange, counter-intuitive, and completely true. It’s only a “paradox” because our brains can’t handle the compounding power of exponents. We expect probabilities to be linear and only consider the scenarios we’re involved in (both faulty assumptions, by the way).

  • 1
    Another probability related one: If you bought $1,000,000$ scratchcards, and each scratchcard had an independent probability of $1$ in $1,000,000$ of winning the grand prize, what is the probability of winning the grand prize at least once out of all of the scratchcards you bought? A lot of people assume that it should be close to $1$, and would say they would feel ripped off if they didn't end up winning with those odds. Replace the $1,000,000$ with $n$ and the limit approaches $1-\frac{1}{e}$ or about $63%$, and approaches it quite quickly. Even at $n=4$, the probability is $68%$. – zhuli Jun 06 '20 at 08:29
2

The wheat and chessboard problem is probably one of the simplest. Its is an old one too, going back at least eight hundred years or so.

If you put one grain of wheat or rice in the first square of a chessboard, two in the second, four in the third, and so on, doubling the number at each step, how many grains of wheat or rice do you have?

This is a very simple problem, one that only needs pen and paper (doubling and summing) to do; it's just that the immense scale of the result is completely unintuitive, $$N = \sum_{i=1}^{64} 2^{i-1} = 2^{64} - 1 = 18,446,744,073,709,551,615$$ which is on the order of a couple of hundred times the yearly production of wheat on the entire planet.

Guest
  • 141
2

In a far future, humanity has settled on Mars and Venus. Now Earth gets visits both from Martians and from Venusians, and some of them happen to be criminals.

In particular, of all the crimes done by visitors from other planets, $3/4$ are done by Martians, while only $1/4$ are done by Venusians.

Therefore a Martian visitor is three times as likely to be a criminal than a Venusian visitor, right?

Wrong! With the data given above it is impossible to say whether Martian or Venusian visitors are more likely to be criminals.

The conclusion would only be right if there happen to be as many Martian visitors as Venusian visitors, and moreover Martian and Venusian criminals do on average the same number of crimes on visit.

If there happen to be three times as many Martian visitors as Venusian visitors, both have the same crime rates. Should the number of Martian visitors be five times as high as the number of Venusian visitor, the same numbers would even mean that Martian visitors are doing less crimes on average than Venusian visitors. On the other hand, if Venusian visitors outnumber Martian visitors, it means that Martian visitors are even more criminal than those numbers suggest.

Also, if each Martian criminal on average tends to do three times as many crimes as a Venusian criminal, then even with equal numbers of visitors it means that the probability of a random Martian visitor being a criminal is the same as the probability of a random Venusian visitor being a criminal. In that case the Martian criminals do more crimes not because there are more of them, but because the criminal ones are more active.

This is unintuitive because most people don't even notice that they make hidden assumptions when making that wrong conclusion.

celtschk
  • 43,384
1

The "paradox" with the name "Achilles and the Turtle" is very confusing to a lot of people. It goes as follows:

Archimedis and a turtle have a race. The Achilles moves 10x faster than the Turtle, the ladder gets a head Start of 100 meters though. The race begins, Achilles quickly runs the 100m - the turtle however moves 10m in this time, so it is still 10m ahead. Achilles tries to run the 10m to the turtle, but in that time the turtle again crawls foreward 1m - so it is still 1m ahead. This continues forever. In the end - can the faster Achilles ever overtake the turtle?

This is confusing since it seems like Achilles is getting arbitarily close to the turtle but can't ever pass it, although he should be able to.

A relevant factor, namely time, is not mentioned. The time spans we consider are getting smaller by a factor of 10 every iteration, so if we sum the observed time up we obtain a geometric series with q= 1/10. This implies that the total time we can witness with this kind of observation is finite, since the geometric series converges. This is the reason that in this perspective Achilles can't ever pass the turtle - he is never given quite enough time! With linear time however, he would simply pass the turtle at the 111.11111... meter mark.

This unintuitive, because in our world time goss by linearly, so it seems logical to assume this must hold for this problem as well - a fallacy! Also, some people have a misleading intuition of mathematical series because they don't consider that infinitely many things can add up to something finite.