12

I recently watched a YouTube video on Banach-Tarski theorem (or, paradox). In it, the presenter builds the proof of the theorem on the basis of a non-intuitve assertion that there as as many even numbers as there are whole numbers, which he 'proves' by showing a 1:1 mapping between the two sets.

But would that constitute a valid proof?

To me, the number-density (per unit length of the number-line) for whole numbers is clearly more than that for even numbers. And this, I'm sure, can also be trivially proved by mathematical induction.

Later, in the same video, it is shown how the interval [0,1] contains as many real numbers as there are in the real number line in its entirety. Once again, using the common-sense and intuitive concept of 'number-density', there would be clearly (infinitely) more real numbers in the entire number line than a puny little section of it.

It seems, the underlying mindset in all of this is: Just "because we cannot enumerate the reals in either set, we'll claim both sets to be equal in cardinality." In the earlier case of even and whole numbers, just "because both are infinite sets, we'll claim both sets to be equal in cardinality." And all this when modern mathematics accepts the concept of hierarchy among even infinites! (First proposed by Georg Cantor?)

Is there a good, semi-technical book on this subject that I can use to wrap my head around this theme, generally? I have only a pre-college level of knowledge of mathematics, with the rest all forgotten.

nxs
  • 123
  • 17
    One set having "as many" elements as another should not depend on how the elements are labelled. For instance, $\rm{one,two,three,\cdots}$ should be equinumerous to ${1,2,3,\cdots}$. Thus, the notion of one-to-one correspondence. The concept of density is distinct from the concept of cardinality. – anon Mar 29 '16 at 03:59
  • 4
    Google "Hilbert's Grand Hotel." that should get you started. – user317176 Mar 29 '16 at 04:00
  • @arctictern But it it merely labelling or also set-membership? The numbers 1, 3, 5, 7... , for example are present in the set of whole numbers but absent in the set of even numbers. – nxs Mar 29 '16 at 04:01
  • 12
    We define cardinality in terms of maps. That is what we mean. –  Mar 29 '16 at 04:02
  • 1
    @nxs your confusion stems from trying to think of cardinal equality in geometric terms. That simply can not be done since by its very definition cardinal equality is about abstract sets, devoid of any geometry. You got confused because you consider sets for which you have a concept of geometry for, and you forgot to neglect it since you are only concerned with cardinality. – Ittay Weiss Mar 29 '16 at 04:04
  • 4
    There are many different ways to talk about "how big a set is." For example, the set of real numbers from $[0,1]$ compared to the set of real numbers from $[0,2]$. In terms of cardinality, they are the same. In terms of Lebesgue measure, the second is of measure $2$ while the first was of measure $1$. In similar fashion, the cardinality of the sets in question might seem unintuitive (interestingly, the set of all rational numbers is countable, yet dense on the real number line), but moving away from the intuitive idea of "how many" from geometry, applying the definitions is needed. – JMoravitz Mar 29 '16 at 04:06
  • @IttayWeiss But how do we conceive or even prove abstract concepts? Abstract math, it seems, would be no different from throwing a set of dice to make some random rules (called definitions or axioms), and proceeding on the basis of such rules, with occasionally landing somewhere interesting. Even in the context of Banach-Tarski theorem, a sphere, its duplication etc are being talked talked about. These are clearly geometrical concepts and physical processes. – nxs Mar 29 '16 at 04:12
  • 1
    For reading, wikipedia is as fine as any to get started. Otherwise, principles of mathematical analysis by Rudin (the text is sometimes referred to as "baby rudin") will have information on cardinality in chapter two. – JMoravitz Mar 29 '16 at 04:12
  • While my journey may have just begun, I can't thank you all enough for sharing your insights and pointers. (My reputation is not high enough to upvote your comments.) – nxs Mar 29 '16 at 04:15
  • 6
    @nxs Abstract math is founded on axioms and definitions, which on the one hand seem arbitrary and artificial, but on the other hand are chosen because they are useful in describing the real world. – Plutoro Mar 29 '16 at 04:18
  • 1
    @AlexS Or because it's fun! Some days I feel like taking the axiom of choice, but Sundays are reserved for the axiom of determinacy. – YoTengoUnLCD Mar 29 '16 at 04:46
  • @YoTengoUnLCD On which day(s) do you assume the continuum hypothesis? – Plutoro Mar 29 '16 at 04:47
  • @AlexS Only when I want a wild Friday. I never assume the well ordering principle though, as that's obviously false. :-). – YoTengoUnLCD Mar 29 '16 at 04:48
  • 2
    @nxs responding to your comment to me, you seem to confuse abstract with arbitrary. There is nothing arbitrary in the choice of axioms for the things we study, no matter how abstract or concrete they are. – Ittay Weiss Mar 29 '16 at 05:05
  • 1
    The notion of "number-density" is definitely wrong for cardinality, but I wonder if it is something used elsewhere in mathematics. – ThomasW Mar 29 '16 at 06:48
  • 1
    I suggest Stories About Sets by Vilenkin. – DanielWainfleet Mar 29 '16 at 08:27
  • @ThomasW I suggest looking up measure theory, which is one area where concepts like density become relevant. – David Z Mar 29 '16 at 12:46
  • 2
    Even though your misconceptions have been corrected by the many other responses, I would like to point out that the "hierarchy among even infinites" that you attempted to use to castigate the idea, is in fact based on this idea. It was Georg Cantor who defined cardinality as equivalence between sets that can be put in 1-1 correspondence. And by means of that definition, he discovered that not all infinite sets are the same size ($\Bbb R$ cannot be put in 1-1 correspondence with the natural numbers). But he also saw other infinite sets could be (e.g., evens and all naturals). – Paul Sinclair Mar 29 '16 at 15:20

3 Answers3

23

This is a tough concept for most people learning about the sizes of infinite sets for the first time. Number density, as you call it, is an intuitive concept, and it makes more sense up front. But there are a couple of problems with it. It seems like you are wanting to define size of a set $A$ by the limit as $n$ goes to infinity of the number of numbers in $A$ less than $n$ divided by $n$. This, again seems natural. But what if my set were not made of numbers? What if it were a set of polygons? Or lines? A set of functions? What if it was a set of sets? There is an even bigger problem. The digits we write down when enumerating a set are really just symbols$^\ast$. The set $\{1,2,3,4,...\}$ is just a collection of symbols. If I change the symbol "1" to "2" and change "2" to "4", "3" to "6", and so on, I get the set $\{2,4,6,...\}$. I changed the way that each symbol looks. Have I really changed the size of the set? Is there a universal way to define the size of a set? There is. There is no confusion about the size of finite sets. It is also easy to see that if a function from a finite set $A$ to another finite set $B$ is one-to-one, and hits everything in $B$, then $A$ and $B$ have the same size. We simply extend the idea to infinite sets. This avoids the problem of having to have a number system pre-defined on the sets. It avoids the problem of relabeling the elements of the sets. And most importantly, it acknowledges that the size of a set is whatever we define it to be. So we choose a definition that is useful. This is a useful definition. In other words, your question "what constitutes a proof" is ill-posed. We do not prove that two sets are the same size if there is a bijective function between them, we define it that way.

As for the set $[0,1]$, it is not hard to find a bijection from $(0,1)$ to $\mathbb R$, so the definition says they are the same size. There is another theorem that says if we add a finite number of elements to an infinite set, then we do not change its cardinality. Thus, $\mathbb R$ and $[0,1]$ have the same cardinality.

As for books, I would suggest Mathematical Proofs, by Gary Chartrand.

$^\ast$Thanks to Todd Wilcox for the revised wording here.

Plutoro
  • 22,730
  • 3
  • 41
  • 71
  • Thanks, first off. Would you like to comment on my "chocolate" comment under @artic tern's response? It seems to me, there will be some cases where the symbol or the label names will matter because of the "identity" or the "nature" of the member elements involved. So, while cardinality may be useful to me as I want the resulting sphere to be as heavy as the original one, I also want it to remain the delicious chocolate and not become some other material (like plastic, glass, or wood). Sorry, but that is the best intuitive analogy I could come up with to express myself. Will checkout ur book. – nxs Mar 29 '16 at 04:51
  • 3
    Banach-Tarski is a nice exercise demonstrating a strange result, but it should never be mistaken for something we could actually do to a ball of chocolate, as it ignores the fact that matter is made up of atoms and molecules. Putting that aside, never fear! If some chocolate particles accidentally got relabeled wood particles, just find a bijection with another set of chocolate particles, and use this to switch out the bad ones. – Plutoro Mar 29 '16 at 04:57
  • 3
    "A number is just a symbol." Perhaps a number can just be a symbol, but normally in mathematics it is more than just a symbol. – ThomasW Mar 29 '16 at 06:51
  • 3
    A cautious +1 for "numbers are just symbols". A possible refinement of that statement might be "the groups of digits that we write down to represent numbers are just symbols". The numbers themselves have distinct properties regardless of representation (e.g., the object represented by "A" in base 16 has the same properties as it has when represented by "10" in base 10). With respect to cardinality, there is no difference between the objects we traditionally denote by "1" and "2", but in other areas, those two objects of course have different properties. – Todd Wilcox Mar 29 '16 at 12:37
  • 1
    @ToddWilcox You're right. I was perhaps a bit too flippant with my original statement (see my recent edit). Regardless, we could decide to change the properties of all the numbers in a set (move to a different ordering, topology, or algebraic structure) and not change the cardinality. – Plutoro Mar 29 '16 at 13:46
  • 3
    There is an intuition that if one set is a proper subset of another (that is, all members of A are members of B but there are members of B that are not in A), then the first set is "smaller" than the second. By that definition, though, there is no sense of which of two disjoint sets is "larger". – Michael Lorton Mar 29 '16 at 19:00
  • @Malvolio: Well the common definition of size based on bijections is heavily reliant on the axiom of choice. Without that, one can use either injections or surjections. But from a computational theory perspective these two are very very different. Injections correctly capture the size comparison, but surjections capture complexity rather than size. For example, Cantor's theorem that there is no surjection from a set to its powerset is true in a countable model of ZF only because that model does not know enough surjections (too complicated), but externally there is a reverse injection. – user21820 Mar 30 '16 at 03:43
  • @Malvolio: And internally it thinks there is no reverse injection but that's because it doesn't know enough about the powerset. (See http://math.stackexchange.com/a/1421921 for a proof that there is no injection in any model of ZF.) So it really depends on what you would like. I would say injections is the right concept because if you do know enough about powersets then injections correctly capture size, and matches with the intuition that a set is at least as large as any subset. Note that a set may not have a computable surjection onto a subset, whereas there is always a reverse injection. – user21820 Mar 30 '16 at 03:54
7

Once we determine that the notion of one-to-one correspondence defines a meaningful notion (one we refer to as "cardinality"), it then makes sense to determine the consequences. One of the consequences of this definition is that two sets can have equal size even if one is properly contained in the other. If you don't like it, then that's too bad. There are of course distinct but related concepts you can go learn about, like measure or density, but that doesn't change the fact that cardinality is a meaningful notion which captures an important intuition. If you feel a proof that one set has "as many" elements as another set is sketchy, then the issue is that the mathematician interprets "as many as" in terms of one-to-one correspondence whereas you don't.

Let's go back to the very idea of counting. The whole point is that different sets can be "the same" in a certain way even if they have very different-looking elements. The way to "forget" what the elements look like when determing if two sets have equal "size" is to speak of something that does not depend on how elements are labelled, in other words something that doesn't depend on relabelling elements, in other words something that prescribes two sets have the same size whenever there is a one-to-one correspondence between them. This is an important way to generalize the intuition of counting from finite sets to infinite sets.

If you want to think about it geometrically, imagine points in space. Whether two sets of points have "the same number" of points should not change if we move points around. Now consider points at integer coordinates $\dots,-2,-1,0,2,1,\dots$ on the number line. Then mentally zoom in by a factor of $2$ so that now the "same" set of points resides as double integer coordinates $\dots,-4,-2,0,2,4,\dots$. Zooming in shouldn't change the number of points! Now if you take all the points left of $0$, shift them to the right $1$, and then rotate them (say in an ambient 2D plane) over to the positive side of the number line, you end up with points at whole number coordinates $0,1,2,\dots$. This is how I mentally picture bijections between countable sets.

anon
  • 151,657
  • Thanks, first off! While I understand now that cardinality is a different notion than density, in the context of Banach-Tarski's sphere duplication, I think the labeling or the 'identity' of the set members does matter. For example, if you're duplication a sphere of chocolate using their process, you would want all infinitesimally small particles of chocolate to be in the resulting spheres and not particles of some other material like plastic or sand. (Ignore here the finite and limiting sizes of atoms). But if my intention is to duplicate a sphere of glass that I use as a paper-weight... – nxs Mar 29 '16 at 04:29
  • ... then, obviously I would care less if the identify of the constituent particles in the duplicated object is plastic or glass or wood. – nxs Mar 29 '16 at 04:30
  • 1
    @nxs Keep in mind that Banach-Tarski's theorem uses a lot more than just cardinality arguments (has almost nothing to do with it). It uses the concept of measure (which is another, much more complex way of assigning a "size" to a set). You should also avoid using ill defined terms (infinitesimally small particles of chocolate, what does that even mean?). – YoTengoUnLCD Mar 29 '16 at 04:42
  • @YoTengoUnLCD The word 'infinitesimally small' was used to allow the sphere duplication process... by leaving infinitely many set-members (of chocolate) at our disposal. I used 'chocolate' merely to convey the 'physical identity' requirement... that for some certain problems, all labels/symbols may not be equivalent. Would such a requirement fling this example out of the reaches of set theory? – nxs Mar 29 '16 at 05:04
  • @arctictern I found your first two paras beautfully articulated, thanks much. – nxs Mar 29 '16 at 09:48
  • 1
    @nxs: Chocolate is not made of points, but chocolate bits, and if you like fundamental particles.. Pure points and perfect lines exist only in mathematics and not in the real world. So there are as many points on the real line as there are between 0 and 1 on that same line, but that has no implication whatsoever for the real world. – user21820 Mar 29 '16 at 13:28
  • 1
    @nxs To go with your chocolate particle analogy, for every particle in either of the new spheres we can pinpoint the place on the original sphere that this particle came from. So if that point was chocolate before the transformation, since the only thing we have done is move it - it remains chocolate. (This assumes the infinite divisibility of chocolate as well as the axiom of choice) – Taemyr Mar 29 '16 at 16:43
  • @user21820 Well said! – nxs Mar 30 '16 at 03:10
  • @user21820 Though, as an afterthought, I wonder why the rigmarole of the process suggested in Banach-Tarski?! Why not instead simply say, "A sphere contains infinite points in it, so, instead of stopping at two, keep making as many spheres out it... since an infinity of something would never really run out"? – nxs Mar 30 '16 at 03:31
  • @Taemyr: Yea, for a while people thought that things were infinitely divisible. It is actually a reasonable idea at large scales, but if one thinks about it a little more carefully, one must conclude that there is some final underlying structure that is not divisible, by observing that there are physical laws governing everything, and certainly the laws aren't infinitely divisible. (Of course, easier realized with hindsight.) – user21820 Mar 30 '16 at 03:32
  • @nxs: Indeed by repeating Banach-Tarski $n$ times you can divide a 'solid' sphere into finitely many sets of points that when moved around rigidly can be formed into $n$ 'solid' spheres, where 'solid' is in the mathematical sense. However, you cannot get infinitely many 'solid' spheres because then you would need infinitely many pieces. Even if you allow countably infinitely many pieces, I'm not sure if it is possible, since it does not follow from the ability to make finitely many copies. – user21820 Mar 30 '16 at 03:35
  • 1
    Banach-Tarski has nothing to do with chocolate, and at the risk of condescension one should probably wait on trying to understand that theorem until after one more fully understands basic cardinal arithmetic. – anon Mar 30 '16 at 17:18
5

When we pass from finite to infinite sets, many aspects of our intuition break down and need to be updated. We define cardinality by the existence or not of a bijection. If there is a bijection between two sets they have the same cardinality. If not, the one that can be injected into the other is smaller. When you do this, all infinite subsets of the naturals have the same cardinality, as do the rationals. The reals are strictly greater-Cantor's diagonal proof shows that. We do not say that all sets greater than the naturals have the same cardinality. Cantor's diagonal proof can be used to show that the number of subsets of any set is greater than the number of elements of the set, so the number of sets of reals is greater than the number of reals. Then the sets of sets of reals are greater yet. It is a tower that goes on unimaginably far, but for most of mathematics we don't need very many of them. For a semi-technical introduction, I like Rudy Rucker, Infinity and the Mind.

Ross Millikan
  • 374,822