22

this might be a silly question, but I was wondering: PA cannot prove its consistency by the incompleteness theorems, but we can "step outside" and exhibit a model of it, namely $\mathbb{N}$, so we know that PA is consistent.

Why can't we do this with ZFC? I have seen things like "if $\kappa$ is [some large cardinal] then $V_{\kappa}$ models ZFC", but these stem from an "if".

Is this a case of us not having been able to do this yet, or is there a good reason why it is simply not possible?

Asaf Karagila
  • 393,674
K. 622
  • 923
  • 20
    Stepping outside PA is an "if". We prove PA is consistent if ZFC is. – Malice Vidrine Nov 02 '18 at 17:10
  • 1
    It is an "if", but its "if the axioms of ZFC are assumed, then PA is consistent" ... we don't need to assume that ZFC is consistent. (If ZFC is not consistent, then of course given ZFC we can prove that PA is consistent, or anything else, by the Principle of Explosion). – Jim Balter Jul 21 '21 at 05:22

4 Answers4

29

The problem is that, unlike the case for PA, essentially all accepted mathematical reasoning can be formalized in ZFC. Any proof of the consistency of ZFC must come from a system that is stronger (at least in some ways), so we must go outside ZFC-formalizable mathematics, which is most of mathmatics. This is just like how we go outside of PA-formalizable mathematics to prove the consistency of PA (say, working in ZFC), except that PA-formalizable mathematics is a much smaller and relatively uncontroversial subset of mathematics. Thus it is a common view that the consistency of PA is a settled fact whereas the consistency of ZFC is conjectural. (As I mentioned in a comment, as a gross oversimplification, “settled mathematical truth (amongst mainstream classical mathematicians)” roughly corresponds to “provable in ZFC”... Con PA is whereas Con ZFC is not.)

As for proving the consistency of ZFC, as you mention, we could go to a stronger system where there is an axiom giving the existence of inaccessible cardinals. Working in Morse-Kelley is another possibility. In any case, you are in the same position with the stronger system as you were with ZFC: you cannot use it to prove its own consistency, and since you’ve made stronger assumptions, you’re at a higher “risk” of inconsistency.

  • 5
    Thank you your answer, that makes perfect sense. As I mentioned on the other answer, I think the confusion came from reading "we know PA is consistent" in the wrong light. – K. 622 Nov 02 '18 at 17:21
  • 1
    @K. 622 That’s right. In a very crude (and possibly contentious) approximation “we know” $\approx$ “ZFC proves” – spaceisdarkgreen Nov 02 '18 at 17:28
  • Okay, that makes sense, thanks! Slightly related, if you don't mind me asking: when we say "$L$ (or $V$) is a model of ZFC", where are we stepping out to, to prove these statements? What is the "if" in that case? If there is a difference is this related to $L,V$ being classes and not sets? – K. 622 Nov 03 '18 at 01:34
  • @K.622 It is different for proper classes. "V is a model of ZFC" is just an assumption we're making: we're saying the axioms of ZFC holds in our universe of sets under consideration. "L is a model of ZFC" means every axiom of ZFC holds relativized to L. We can prove this in ZF axiom by axiom, but we can't formalize "for all ZFC axioms $\phi,$ $L\models \phi$" in ZF, since we can't formalize the satisfaction relation for proper classes. The meaning of the result is we've proved (in our metatheory) that Con(ZF) implies Con(ZFC) (and Con(ZFC+GCH), ZFC+V=L, and ZFC+ anything that holds in L). – spaceisdarkgreen Nov 03 '18 at 03:55
  • @K.622 (I should have said any statement ZF can prove holds relative to L). The reasoning behind this is that from any contradiction from ZFC + V=L, we can prove, in ZF, the relativizations to L of the axioms we need to make the contradiction, and then use the relativization of the proof of contradiction from there, generating a contradiction in ZF. From this angle, you can look at "V is a model of ZFC" as a completely trivial proof of the completely trivial fact that Con(ZFC) implies Con(ZFC). – spaceisdarkgreen Nov 03 '18 at 04:05
  • @K.622 (Note in the V case, we can't infer Con(ZF) implies Con(ZFC), since we can't prove in ZF that the axiom of choice holds in V (i.e. that it holds). We can only do that in ZFC where it's an axiom. Hopefully it's not too hard to convince yourself that if it were hypothetically possible (after all, the proof that it isn't very non-trivial), then Con(ZF) -> Con(ZFC) would be an obvious corollary.) – spaceisdarkgreen Nov 03 '18 at 04:19
  • This seems to make sense. If we use zfc, and then show some thing in it is formally consistent and is isomorphic to the system itself, then it just all but proved its own consistency and then the issues with that. – marshal craft Nov 03 '18 at 10:10
  • @marshalcraft Yep, exactly. On the other hand if a ZFC can prove a set is a model of some axioms, we have an absolute consistency proof in ZFC. We can do this for PA, ZFC-Inf, ZFC - Power set, or ZFC minus all but any finite number of reflection axioms. – spaceisdarkgreen Nov 03 '18 at 15:58
  • I’d also be remiss not to add that there’s little value in an absolute consistency proof in a stronger system in terms of actually believing in the consistency. If we believe the stronger system is sound, we believe it is consistent, and thus we already believe the weaker one is consistent, so we are being circular (as philosophers, not mathematicians). Thus the skeptical crowd is much more likely to be convinced by Gentzen’s proof of Con PA from PRA + $\epsilon_0$-induction or Godel’s dialectica proof than by the proof in a stronger system that $\mathbb N$ is a model. – spaceisdarkgreen Nov 03 '18 at 16:51
  • I think there some issues too, zfc makes formal the notion of sets to avoid Russell paradox and what not. It doesnt formally show things about axioms and consistency. I've read something to the extent that formal systems have issues with infinite sets and what not and fall way short of zfc. Additionally the author notes godel's consistency theorems presume a set theory not dissimilar to zfc in creating the formal logic just as a formal set theory presumes some ground logic in laying out the formal set theory. I will need to recall the text on set theory to reference it. – marshal craft Nov 05 '18 at 11:49
  • But there does seem to be a rough interplay between logic and sets. Similar to de Morgan's law and set theory. Some presume sets others logic. Also I believe godels theorems relate to enumerable sets of axioms not existing, and hence the idea of computer science and computation. It's all inter-relatedit seems regardless if enumerable sets or axioms. Zfc maybe a formal set theory and logic at some extent still intuitive, is probably related, then considering current problems such as goldbach conjecture and what not, continuum hypothesis, axiom of choice, they all exist in some abstract nature. – marshal craft Nov 05 '18 at 11:59
  • But yes ideally we would like everything spelled out in terms of formal language and symbols and operations. As was the motive before Godels no go theorem. So where do we go from there? I don't know that anybody knows as of now. But I've not seen formal system and operations lay out zfc, that alone. – marshal craft Nov 05 '18 at 12:06
  • Cause otherwise i think a turing machine in perhaps infinite time would be able to work put the theorems of zfc. What could be the issues with this? Undefinable reals? But we can see it is perhaps a multifaceted discipline solving this mathematics question. – marshal craft Nov 05 '18 at 12:21
  • Some arent even convinced it is a math question, but "computer science"? – marshal craft Nov 05 '18 at 12:24
14

Using ZFC plus the axiom "Uncountable strong inaccessible cardinals exist" to give a model of ZFC is exactly the same kind of "stepping outside" as when you use ZF to make a model of PA.

Arthur
  • 199,419
  • 2
    Ooh I see. I was essentially confused from reading "we know PA is consistent" elsewhere, when I should have interpreted that as "within the standard system we use today, PA is consistent". Thank you for clearing that up. – K. 622 Nov 02 '18 at 17:18
10

Of course we can. And we do just that.

To "step outside of $\sf PA$" means that you assume the consistency of a far stronger theory, e.g. $\sf ZFC$, which lets you construct $\Bbb N$ as an object and prove it satisfies $\sf PA$.

When you say "assume that $\kappa$ is a cardinal such that $V_\kappa$ is a model of $\sf ZFC$" you effectively saying "We are stepping outside of $\sf ZFC$ into a theory "$\sf ZFC+\varphi$ for a suitable axiom $\varphi$, and there we can can find a model of $\sf ZFC$.

In fact, when you assume something like $V_\kappa$ is a model of $\sf ZFC$, you can even find a fairly canonical model of $\sf ZFC$: $L_\alpha$ for the least $\alpha$ satisfying $\sf ZFC$, where $L_\alpha$ is the $\alpha$th step in the constructible hierarchy. As to why it exists, that's another question. But it is there.

Asaf Karagila
  • 393,674
4

You are right (in your comments) that "we know PA is consistent" is actually not because there is an absolute proof/justification of that claim. However, it is very misleading to think that PA's consistency is of the same kind of question as ZFC's consistency.

Firstly, numerous empirically verifiable statements about the natural numbers (via some encoding in physical medium) can be and have been proven using no more than PA, such as Fermat's little theorem, which implies the correctness of HTTPS encryption and decryption, which you are using right now to read this webpage. In this sense, there seems to be a rather concrete interpretation of PA in the real world, at least an approximate one (even if it fails beyond some huge size). So even if PA has no exact interpretation in the real world, we may never observe a failure of PA in the form of an explicit proof that $0=1$ (because there is no short proof of it). Thus in some sense consistency of PA can be justified empirically based on the real world.

On the other hand, ZFC does not enjoy similar ontological or empirical status, except for the fact that it can capture elegantly what mathematicians have been doing for centuries and no inconsistency has been found in ZFC (this is somewhat empirical). It seems there is no non-circular justification of ZFC as a whole. For example, if you use the cumulative hierarchy and take the powerset operation and iteration for granted, you still cannot justify full replacement (see here). If you treat the universe as open-ended, you may be able to obtain some replacement but then you would be unable to justify unbounded specification or unbounded replacement (because you cannot justify well-definedness of quantification over an unfinished universe).

Secondly, consistency is far from what mathematicians really want to have. ZFC+¬Con(ZFC) is consistent, and can prove everything that ZFC can, but everyone (rightly) rejects it, because it is unsound (and worse still proves itself inconsistent). Likewise, even if ZFC is consistent it may be unsound. Some logicians have doubts.

And there is another aspect to this soundness issue. There is no real hope of being able to define "soundness" itself non-circularly, unless we restrict to just "arithmetical soundness". The reason for this is, again, that the natural numbers suitably encoded appear to be the only concrete real-world collection of entities that we can make and test various predictions about. And we of course don't want to have a foundation for mathematics that proves false statements about them! If a first-order theory $S$ is consistent, $S$ does not prove any false $Π_1$ statement (arithmetical sentence with at most $1$ unbounded $\forall$-quantifier in Skolem normal form), but may still prove a false $Σ_1$ statement (with merely $1$ unbounded $\exists$-quantifier). (For example, ZFC+¬Con(ZFC) proves a false $Σ_1$ statement.) So once more we see that PA has a privileged role in the foundations of mathematics.

Thirdly, the fact that ZFC is elegant and virtually all mathematics can be expressed and proven in it does not imply that ZFC is 'more likely' to be 'correct' or even just arithmetically sound. This is because there are other incompatible foundational systems that can also express and prove virtually all mathematics. In other words, large parts of each powerful foundational system simply has no relevance to ordinary mathematics. Just for example, Dmitry Mirimanoff introduced well-foundedness and rank of sets, but never considered all sets to be well-founded, contrary to the regularity axiom in ZFC. And not only does mathematics outside of set theory never use regularity, it cannot be justified to be true of all collections (in any reasonable sense). Furthermore, logicians have investigated non-well-founded set theories as alternative foundations, such as Aczel's axiom and Quine's NF[U] set theory.

user21820
  • 57,693
  • 9
  • 98
  • 256
  • I remember seeing a FOM thread a while back where somebody actually argued that it was the most likely scenario that ZFC is consistent but not arithmetically sound. The argument was that ZFC is well-studied, etc., so it's probably consistent, but the completeness theorem can't give you a construction of a standard model, and why would we expect such a special model to just exist 'by chance'? (Edit: found it https://cs.nyu.edu/pipermail/fom/2009-May/013640.html ) – spaceisdarkgreen Nov 03 '18 at 17:23
  • @spaceisdarkgreen: Thanks for the link! I'll read it later. I personally also think that ZFC is consistent but I am a bit doubtful about its arithmetical soundness, though the only reason I have for my doubt is that I cannot find a non-circular justification of certain parts, and so it's unclear to me whether there is unsoundness buried deeply in it. – user21820 Nov 03 '18 at 17:27
  • @spaceisdarkgreen: I've read the FOM post and the related PDF. Indeed ZFC "makes sense only in terms of a vague belief in some sort of mystical universe of sets which is supposed to exist aphysically and atemporally (yet, in order to avoid the classical paradoxes, is somehow “not there all at once”)", unless we give up at least certain parts, such as unbounded specification and replacement, as I stated in my post, because it is less problematic if every set generated only quantifies over previously generated sets. – user21820 Nov 04 '18 at 04:21
  • @spaceisdarkgreen: And you're welcome to come to the logic chat-room. I think it would really be interesting to talk more in-depth about these foundational issues. =) – user21820 Nov 04 '18 at 04:37
  • @spaceisdarkgreen: I probably should say here that I disagree with some of Nik's conclusions in his related paper on mathematical conceptualism, such as "A further question that could be asked is whether it is possible in a conceptualistic system to use variables which range over proper classes such as P(N). I consider this question in [broken link] and reach the conclusion that this is indeed possible, but only if intuitionistic logic is used". There are other alternatives such as Kleene's $3$-valued logic, where set membership may not be boolean. [continued] – user21820 Nov 04 '18 at 06:56
  • [continued] In particular, see this post where I show how $3$-valued logic can be used not only to actually address Russell's paradox in a satisfying way, but also give a concrete description and explanation of how the powerset of naturals is predicatively unsafe. There are parallels between what I say there and what Nik says about being able to do real analysis without powerset of naturals, but we disagree about the philosophically best way to resolve the issue. =) – user21820 Nov 04 '18 at 07:00