4

Recently, I became fascinated with Set Theory and I am willing to learn more. Although, there are some aspects that I would like to understand before doing it. A lot of questions concerning the foundations of Mathematics as been posted and I went to read some of them. My question is closely related to Set theoretic concepts in first order logic, to be more precise, in the answer given by Peter Smith.


Let’s (naively) pretend that someone not familiar with mathematical reasoning or sets wants to lear about set theory and we want to teach them. I’m trying to understand how this could be done in the most self contained way (since that person don’t have any strong mathematical background).

I think it would be really helpful to lear some logic first. So, I went to search and I pick up the book Logic and Structure by Dirk van Dalen. Although, there are a lot of mathematical terms that “appear” to come from set theory. But this can be solved: just aboard logic “verbally” (as in the link I mentioned).

I was trying to do this but I think this is quite “awkward” to say the least. For example, Dalen starts by presenting the alphabet of propositional logic and show how propositions can be constructed. In his definition, he uses the term smallest set

Definition $2.1.1$ The language of propositional logic has an alphabet consisting of

$(i)$ proposition symbols: $p_0, p_1, p_2, \dots$,

$(ii)$ connectives: $\wedge$, $\vee$, $\rightarrow$, $\neg$, $\leftrightarrow$, $\bot$,

$(iii)$ auxiliary symbols: $($, $)$.

Definition $2.1.2$ The set $\mathsf{PROP}$ of propositions is the smallest set $X$ with the properties

$(i)$ $p_i \in X$ ($i \in N$), $\bot \in X$,

$(ii)$ $\varphi, \psi \in X \implies (\varphi \wedge \psi), (\varphi \vee \psi), (\varphi \rightarrow \psi), (\varphi \leftrightarrow \psi) \in X$,

$(iii)$ $\varphi \in X \implies (\neg \varphi) \in X$.

Someone that has studied (at least naive) set theory will understand what this means, but can one create the same without using this ideas?

What I want to know is, to develop propositional calculus, how can one express the second definition without using the term smallest set?

Because this is quite a good idea to model the set of propositions. It allows us to prove the Induction Principle

Theorem $2.1.3$ (Induction Principle) Let $A$ be a property, then $A(\varphi)$ for all $\varphi \in \mathsf{PROP}$ if

$(i)$ $A(p_i)$, for all $i$, and $A(\bot$),

$(ii)$ $A(\varphi), A(\psi) \implies A((\varphi \square \psi))$,

$(iii)$ $A(\varphi) \implies A((\neg \varphi))$.

This principle is good, for example, to show that every proposition will have one and only one truth value. But how can I prove it without the previous notion of smallest set?

Is there even a way to handle propositional and first order logic without using set theory so then I can use it to work with set theory? If yes, how? (Or where can I find such an approach in the literature?)

Thank you for your attention!

Air Mike
  • 3,806
  • 3
    These issues also used to bother me a lot. I finally (mostly) got over it by accepting that in more advanced treatments (which I classify Dirk van Dalen's book as), basic set theory notions are allowed in the metalanguage. Of course, when it comes to dealing with axiomatic set theory itself from scratch, then everything is cast into the formal language and results proved in formal ways (such as what evidence is allowed, and not allowed, in a courtroom trial). FYI, the "intro. book" that worked best for me is Hamilton's Logic for Mathematicians. – Dave L. Renfro Aug 14 '21 at 18:27
  • @DaveL.Renfro first of all thank you for the comment and the suggestion. I was checking the book recommended and I think it is quite suitable for what I was thinking. Allow me just to say a couple of things: my purpose is to be as concise as possible. Then, that book would serve as a meta-theory to take axiomatic set theory? Once again, thank you for your help. – Air Mike Aug 14 '21 at 18:44
  • I know that we must start somewhere, we couldn’t do things from nothing. But in this way, after developing enough (axiomatic) set theory, then one could attack logic in a more formal way – Air Mike Aug 14 '21 at 18:46
  • 2
    We are somewhat turning in a circle: one cannot learn set theory without knowing a bit of logic; one cannot lern serious logic without a bit of set theory. If what you are really interested in is set theory, the best thing to do is to find an introductory book that has a preliminary chapter providing the amount of logic you will need to start set theory.For example : Cunningham, Set Thory A First Course . – Floridus Floridi Aug 14 '21 at 20:26
  • 2
    The amount of logic or set theory required here is miniscule if the definitions are phrased properly. $\mathsf{PROP}$ should be defined to comprise the sequences of symbols formed using an appropriate set of rules. All that is required to "bootstrap" logic and set theory is a modest amount of intuition about sequences of symbols: anyone who can speak a natural language has that intuition. – Rob Arthan Aug 14 '21 at 22:55
  • 1
    @FloridusFloridi thank you for your comment. That’s indeed what is bothering me - turning in a circle. My approach would be to define a formal language for set theory that $(1)$ would aloud to conclude when a formula is true (because of the Separation Axiom) and $(2)$ would aloud to conceive proofs (the inference rules, etc…). With this then I would be able to formalize logic and follow for more deep stuff like generalising formal languages (not only for set theory) and studying completeness and the like. How can I do this? – Air Mike Aug 15 '21 at 00:17
  • @RobArthan Thank you so much for your comment. But I have one further question. For example, the Separation Axiom allows to construct a subset of a given set by “collecting” the elements that satisfy some property, i.e., that turn that property into a true statement. This must be formalised, can we just rely on our intuition? – Air Mike Aug 15 '21 at 00:20
  • 2
    Already asked and discussed here: what is new in your question? – Mauro ALLEGRANZA Aug 15 '21 at 09:42
  • 1
    To get you started in formal logic, may I humbly suggested the tutorial included with my proof-checking freeware. It is designed to teach the basic methods of proof to undergraduates and advanced high-school students. It makes use of a simplified form of formal logic that is implicitly used in most math textbooks (not standard FOL). Free download at my homepage http://www.dcproof.com – Dan Christensen Aug 15 '21 at 11:59
  • @MauroALLEGRANZA the point is: what is the necessary logic for set theory (and by this I don’t mean the informal introduction in almost every sets theory book, I am looking for something more rigorous, maybe something that doesn’t use sets to avoid circularity). My question came from the question you mentioned, I linked it at the beginning – Air Mike Aug 19 '21 at 22:32
  • @DanChristensen I really appreciate your answer but that is too naive. I am looking for something formal.. – Air Mike Aug 19 '21 at 22:33
  • 1
    @AirMike There is nothing "informal" or "naive" about my system. I think you may find it useful as a starting point. It was designed to introduce students to the basic methods of mathematical proof, e.g. proofs by contradiction or by cases, and making generalizations. It may not be for you if you have already mastered them. – Dan Christensen Aug 20 '21 at 02:58
  • 1
    @AirMike - you have to start somewhere... You can develop set theory as a mathematical theory, using "natural logic". The same for formal logic: Aristotle developed it without set theory or other mathematical tools. With modern math logic we need a "moderate amount" of mathematics: we need at least numbers to apply induction to basic syntactical constructs or sets to use recursion. In conclusion, the issue is - as per linked post - you cannot deal the problem as an "egg or chicken" problem, because we cannot find an "absolute" foundation that does not assu.me some initial knowledge. – Mauro ALLEGRANZA Aug 20 '21 at 08:43
  • @DanChristensen when I said “informal” or “naive” I was actually saying that right now I am beyond that, i.e., I have already had contact with that material. I am trying to understand the develop of axiomatic set theory. It seems to me that it requires a lot of logic. But lot of logics tends to use lot of sets. And to me, that seems circular. So far, this has been bothering me and I am looking for some “peace of mind” about this issue – Air Mike Aug 20 '21 at 18:36
  • @MauroALLEGRANZA that’s what I was looking for. I agree that we need to start somewhere. May I ask where? For example, I can’t feel relaxed when I see the axiom of comprehension. Because of the concept of “formula”. But then, when I go to logic to understand this ideas, I see sets, functions, relations,… And this has been bothering me for a while, because it seems circular. I have read a lot of posts here about this issue, but the truth is that I never read any satisfying answer that would calm me. Do you have some suggestion to deal with this problem? – Air Mike Aug 20 '21 at 18:41
  • @AirMike - a general (debatable) attempt has been Bourbaki's one: a first level of formal logic (without meta-theory) and then axiomatized set theory and then the development of the theories of various mathematical structures. All "explained" with natural language, of course... – Mauro ALLEGRANZA Aug 21 '21 at 09:06
  • For a "moderate" approach to formalist mathematics, see Tourlakis' Lectures – Mauro ALLEGRANZA Aug 21 '21 at 09:07
  • @MauroALLEGRANZA I actually though of Bourbaki’s work. Because it is really nice attempt to do things from scratch, basing on natural language and “innate logic”. As far as I am aware he develop the necessary logic concepts to construct theories. I though that maybe it would be a nice starting point, because modern logic (as for example, in the book I am reading - “Logic and Structure”) can be done with that – Air Mike Aug 21 '21 at 12:42
  • @MauroALLEGRANZA Although, in the first chapter, there are several results (for example the criteria of substitution). These results are proved by using natural language and “common intuition”, right? – Air Mike Aug 21 '21 at 12:43
  • @MauroALLEGRANZA I looked for Tourlakis’ Lectures and I think that it is really nicely done. I have to say that I am really grateful for your suggestions. I think Bourbaki and Tourlakis may gave the answers and the peace of mind that I am looking for. Although, as first approach to this subject, which one you would highly recommend: Bourbaki or Tourlakis? – Air Mike Aug 21 '21 at 12:52
  • @MauroALLEGRANZA I’m really sorry for all my comments, but can you tell me how would interpret this: In Tourlakis, the concept of smallest set appears - then it is said that this means it is the least inclusive set. But what does this mean? (I know a little set theory and I know that the smallest set with some property would be the intersection of all sets with that property. But how can this concept must be interpreted without these notions but in such a way that would avoid ambiguities?) – Air Mike Aug 21 '21 at 12:58
  • Personally, I am not very concerned about the use of the words "set", "smallest set", etc. in the informal commentary that often accompanies the presentation of formal statements of the ZFC/FOL axioms or formal proofs. I think of such commentary as much the same as the natural language comments that are inserted into source code of computer programs. There are no rules governing such comments. They are ignored by the compiler. – Dan Christensen Aug 21 '21 at 16:02

4 Answers4

2

There are several options. One is to use Backus-Naur Form, which is a concise metalanguage to present context free grammars.The idea here is to view formulas as sequences of so-called terminal symbols, which are recursively generated via so called production rules. Every production rule defines a so called non-terminal symbol together with sequences of terminal and non-terminal symbols, which replace the non-terminal symbol in the generation.

For example the terms of a first-order language with constants and function symbols are generated via the BNF:

$t \space ::= \space x\space |\space c\space |\space ft_1 \ldots t_n$,

where $x$ is a variable of the language, $c$ a constant, $f$ an $n$-place function symbol and $t_i$ terms.

This BNF tells you that the non-terminal $t$ may be replaced by either a terminal $x$ or a terminal $c$ or any sequence consisting of a terminal $f$ followed by the non-terminals $t_1, \ldots, t_n$. In other words: A term is either a variable or a constant or a function symbol followed by an appropriate number of terms.

Analogously, the formulas for a language with a logical signature consisting of $\neg, \wedge, \forall$ is generated by the BNF:

$\varphi::= Rt_1 \ldots t_n \space | \space \neg \varphi \space \space | \space (\varphi \wedge \varphi') \space | \space \forall x \varphi \space$,

where $R$ is an $n$-place relation symbol, $t_i$ are terms and $x$ a variable of the language.

sequitur
  • 1,094
  • First, thank you so much for your answer I really appreciate it. Second, can we develop a formal language in this way? Can we define a semantic so we can understand when something is true? Can we build a deductive system? – Air Mike Aug 15 '21 at 00:27
  • 2
    BNF is just a way of defining the syntax of those formal languages, which can be represented as context free grammars. Rereading your original post, I understand it as asking for a very strong requirement on the metalanguage, namely that it is 'set-free'. This is NOT the case with BNF, since it presupposes the notion of formal grammars, which are certain sets.

    What could be of interest for you is plural logic: Logical systems which use primitive plurals instead of sets in the metalanguage.

    https://plato.stanford.edu/entries/plural-quant/

    Your

    – sequitur Aug 15 '21 at 16:07
  • Yes, yes, that’s really what I was looking for - something “set-free” that allows to develop rigorously the necessary logic to deal with axiomatic set theory. Also, about plural logic, that sounded interesting, can you recommend some bibliography on that? Thank you for your attention. – Air Mike Aug 19 '21 at 22:40
  • I recommend the bibliography in the SEP-article. I would highlight the articles by Boolos as well as the monographs by Oliver / Smiley and by McKay as well as the very recent 'The Many and the One' by Linnebo and Florio. In addition there's some discussion on the question whether plural logic is really set free. See the articles by Resnik, Linnebo (2003), Parsons's 'The Structuralist View of Mathematical Objects', Linnebo's 'Pluralities and Sets', Linnebo / Florio (2016) and Hossack's 'Sets and Plural Comprehension' . – sequitur Aug 20 '21 at 16:48
1

In your second textbook definition 2.1.2, "The set PROP of propositions is the smallest set X..." is only a meaningful proposition when semantically interpreted in its set-theoretic metalanguage. At the classic propositional logic level, we can only say it's either true or false per law of excluded middle, corresponding to whether this definition picks out such a really existent (abstract) math object called set or not satisfying all later properties. If you try to further prove it's really the smallest such set, then you have to employ some predicate from set-theoric or any alternative metalanguage including the semantic notion of order relation.

That being said, in the case of finite natural number we can express in FOL alone, e.g., that there're at least 3 such P objects albeit in a complicated fashion as follows: $$ ∃x ∃y ∃z (P(x) ∧ P(y) ∧ P(z) ∧ x \neq y∧ x \neq z ∧ y \neq z) $$

So generally to prove there're at least n objects in FOL we need to naturally deduce to a sentence with n existential quantifiers and (1+2+..+n) identity negations. But your PROP set example is obviously infinite in nature in terms of its members even though it may be a definite description, so FOL really cannot express by itself for smallest via above at least approach, you need to add order predicate relation from its metalanguage.

How? One approach at the set-theoretic metalanguage level is to first prove a lemma like below:

If PROP is the intersection of a collection of all sets each of which satisfies (i)–(iii) of definition 2.1.2, PROP must also satisfy (i)–(iii).

Then PROP must be the smallest such set since when you take the intersection of a bunch of sets, the result is always a subset of all of the original sets. So the smallest set predicate is now paraphrased and satisfied by the concept of subset and intersection of sets at the metalanguage level.

Finally the 2.1.2 PROP definition is very similar to inductive set-theoretic definition of natural numbers N or Peano axioms. The advantage to define PROP in this manner is to really pave the way for the future application of mathematical inductive proof regarding any general property of PROP, akin to prove general property of natural numbers.

cinch
  • 1,648
  • 1
  • 4
  • 12
  • I agree with you, and that’s my main goal. How can I describe the logic required for axiomatic set theory without using it itself? – Air Mike Aug 14 '21 at 19:47
  • What I mean is: when looking at the axiom of separation, clearly we need some logic. So we must introduce it in order to proceed. But how can this be done without circular reasoning? – Air Mike Aug 14 '21 at 19:49
  • 1
    Axiom schema of separation, like any other axiom in any deductive system, is introduced as unprovable premise to give further semantic power to proceed deduction like you already understood. But underlying logic (inference rules) doesn't get impacted at all. As Peter mentioned set theory is not necessary for math, you can create a naive plural theory treating math concepts as concrete objects, but you still need to add order as additional definitions or axioms in this naive plural theory. Other alternatives are modern type theories such as dependent type theory. – cinch Aug 14 '21 at 20:11
  • When I mention that axiom I was trying to give an exemple of my worries. That axiom “assumes” (maybe this is not the correct way to say this but I will say it) a language. Properties of objects and how can one decide if a condition is true for some object – Air Mike Aug 15 '21 at 00:23
  • How can one develop a naive formal language only to develop set theory? – Air Mike Aug 15 '21 at 00:24
  • @AirMike this is a much broader question like design a programing language which the instruction set can correctly obey command. Some favor expressiveness, others like Ocam's razor. We have many set theory languages (according to your terminology) such as ZF, NF, NBG, etc. Since set theory is meant a foundation for all of math, the existing language semantics may not be ideal for new areas such as algebraic geometry, that's why you have category theory, unimath stuffs. Halmos once wrote a book Naive Set Theory for beginners, as apposed to the more advanced ZF ones... – cinch Aug 15 '21 at 00:56
  • I think the comment of Floridus Floridi and my reply (below the question) will make more clear what I meant.. – Air Mike Aug 15 '21 at 01:25
  • You have to practice using FOL to express naive set theory, for example its axiom scheme of comprehension can be developed in formal first order language as: ∀z1 . . . ∀zn∃b∀x[x ∈ b ↔ P(x)]], where all variables except b range over everything—ordinary objects and sets, while b ranges over only sets. And then you can substitute P(x) using concrete formulas as schematic instances, and soon you'll find Russell's paradox logically... – cinch Aug 15 '21 at 02:11
  • I don’t understand why we still talk about naive set theory. That’s not what I meant… – Air Mike Aug 15 '21 at 02:25
  • @AirMike my understanding of your above question " ...and how can one decide if a condition is true for some object", this is precisely a logic question. Only logic ultimately deals with true or false, not set theory or any other fields including math, they just provide more semantics. The objective reality only acts in a hard-cold mechanical way, only conscious sane people possess the right judgement to decide if a well formed sentence is true or false. I just want to emphasize no circular reasoning here at all, FOL is the foundation of set theory... – cinch Aug 15 '21 at 02:44
  • True, but so far I have only encountered first order logic handled with set theory. Isn’t this circular? If not, then why? – Air Mike Aug 15 '21 at 02:48
  • 1
    @AirMike ah, that makes sense and explains your confusion. I highly suggest you try learning mathematical logic first, I found this free online course is a good one – cinch Aug 15 '21 at 02:53
  • Thank you so much for your suggestion! I went to that site and opened the file about Mathematical Logic. One question: it seems that natural numbers and induction are used. But that are constructed by set theory, right? – Air Mike Aug 15 '21 at 03:07
  • 1
    @AirMike not at all. It only used Peano Axioms (PA) to develop natural numbers with induction. By studying this you'll get more clear, so pls be rest assured there's no circular reasoning here. Hope this can completely settles all your doubt... – cinch Aug 15 '21 at 03:19
  • I am trying but I think it is really hard to look at foundations and not to think that are some circularity… And I don’t know how I can “handle” this – Air Mike Aug 19 '21 at 22:46
  • @AirMike after you learn and apply Lowenheim-Skolem theorem to ZFC (Skolem Paradox), you'll realize the formal first order language of set theory is not rich enough to accurately capture various concepts such as powerset which can mean something quite different from the intended notion intuited by the usual semi-formal logic implicitly employed by most people by default. Once you arrive here, I guess all you confusion will be settled. – cinch Aug 20 '21 at 06:20
1

This reference to sets in "the smallest set $X$ such that..." can easily be circumvented.

You can define a proposition as any assembage that appears in a sequence of assemblages $A_1, A_2, \dots, A_n$, in which each assembage is formed from the preceding ones in certain well-defined ways, as in Definition 2.1.2. Then whenever you need to use a proposition in your formal language, you can require that a suitable finite sequence of assemblages first be exhibited.

The problem is more substantial when you seek to prove propositions about assemblages, propositions, formal proofs, etc. In this case, you must decide whether the meta-theory you are working in is some form of set theory, which is easiest for purposes of exposition, or some form of arithmetic. Much of what is proved in logic about formal proofs, etc., could be formulated in fairly weak systems of arithmetic (where integers correspond via an effective bijection to assemblages). The main problem is that few people have the stomach for the details involved, and certainly, no one would want to inflict all of that on a beginner.

Edit: Your comments clarify what your concerns are. Before answering the questions, I think it would be clearest if I talked a little bit about the ways you can look at a system of logic as a foundation for mathematics.

In the most basic use of logic, you are merely setting up a system of symbols to formalize the statements and proofs of ordinary mathematical argument, say with set theory as a foundation. This is a purely syntactic level.

You have a certain intuitive conception of what the statements are meant to represent, and you need to check the following two things. First, you want to check that you consider the methods of deduction in your system to be valid, i.e. that the system is sound. That is, applied to true statements, they should result only in true statements. Second, you want to check that all the forms of argument ordinarily considered valid in mathematics have counterparts in the formal deductive system, i.e. that the system is complete.

At this purely syntactic stage, you have no formalized concept of the "meaning" of a statement. Therefore, the two points above cannot be proved in any rigorous sense. Soundness can be checked only by convincing yourself that, according to your intuitive sense of what the statements are supposed to "mean," the rules of deduction are acceptable. Completeness, at this stage, can be thought of as experimental in the sense that no one has observed a mathematical proof regarded (by humans) as valid that could not, with sufficient effort, be formalized in the system.

Of course, the forms of reasoning humans consider valid are based on our innate conceptions of logic.

Now let's think about your concern about a vicious circle. At this stage, you have a formal system of proof that you're satisfied with. There's nothing to stop you now from starting to use it to prove theorems of set theory. You don't really need to know any metatheorems to make use of it this way, so there can be no vicious circle.

In practice, you may need to prove a few metatheorems to convince yourself of the intuitive soundness and completeness of your system. Mostly, these will consist of various shortcuts that you can employ in formal proofs.

All of these basic metatheorems can be formalized in a fairly weak system of arithmetic (including some basic form of induction). But if your purpose is really to convince yourself of the two points I mentioned, whose content is intuitive anyway, this shouldn't matter, since all you need as you go along is to check that the metatheorems use no more than intuitively obvious facts about arbitrary finite sequences of characters written down on paper.

The next stage in developing logic is semantic, in the sense that you are concerned with giving a formal definition of the "meaning" of statements and proving metatheorems including, at minimum, formal equivalents of the two points above.

Let's think about what's involved at this stage. For the sake of simplicity, let's say that, instead of set theory, you're concerned with groups or, more generally, with structures consisting of a bunch of objects and a binary operation on them.

It won't have escaped your attention that the word "bunch" above is a cop-out to avoid using the word "set." There's really no way around the fact that a group is bound to be, in some sense at least, something like a set, as is the group operation, which is a "bunch" of triples.

Even set theory, ultimately, is about a "bunch" of objects called sets and a binary relation denoted $\in$ between them, itself a "bunch" of pairs of objects.

I think this makes it clear that the natural medium in which to formulate semantic questions is going to be akin to some version of set theory.

How is this done? Let's return to the example of groups. Now some logicians, for good reasons, like to consider a group to be a set $G$ together with a binary operation (representing the group operation), a unary operation (representing the taking of inverses) and a constant (representing the unit element). But to keep things simple, let's say that the language of groups consists instead of just a binary operation. (There are some technicalities here regarding the status of the binary relation $=$, but they are of no real consequence. For definiteness, we'll assume that the symbol $=$ is interpreted by the actual equality relation in $M$.)

Now you define a structure ${\cal M} = (M,\cdot)$ in the language of groups to be a set $M$ together with a binary operation $\cdot$ on $M$. (I'll just write $M$ for $\cal M$ below.) You define, by induction on the complexity of a formula $\varphi(v_1, \dots, v_n)$ with free variables $v_1, \dots, v_n$, the concept of such a structure $M$ satisfying $\varphi$ when $v_1, \dots, v_n$ are replaced with particular elements $x_1, \dots, x_n$ of $M$.

A statement is just a formula $\varphi$ with no free variables, and a theory is a set $T$ of statements. For example, $T$ could be the set of axioms for groups. A structure $M$ is a model of $T$ if it satisfies each statement in $T$.

At this point we can begin to think about how to formalize the two points from before. Let a theory $T$ and a statement $\varphi$ be given. Then we'll say that $\varphi$ is a syntactic consequence of $T$ if there there is a formal proof (according to your system) of $\varphi$ using only axioms in $T$ as assumptions. We'll say that $\varphi$ is a semantic consequence of $T$ if every model of $T$ also satisfies $\varphi$.

Now the soundness of your system of proof means that whenever a statement $\varphi$ is a syntactic consequence of a theory $T$, it is also a semantic consequence of $T$. The completeness of the system is the converse, namely that whenever $\varphi$ is a semantic consequence of $T$, it is also a syntactic consequence of $T$.

Gödel's completeness theorem states that certain well-known systems of proof, which are fairly easily shown to be sound, are also complete. In other words, if you choose one of these systems, then you have a good system of formal proofs that allows you to deduce a statement if and only if you ought to be able to deduce it.

Now if we switch from group theory to set theory, we'll say that a structure in the language of set theory is a set $M$ together with a binary relation $E$ on $M$, which is meant to represent set membership (but which need not be anything like the actual $\in$ relation between elements of $M$). If the pair $(M,E)$ satisfies, say, the axioms of ZFC, then we'll say it's a model of ZFC.

So here you have to face the reality that in the universe you're working in (at Level 2), you are using some version of set theory, and some of the formal objects (at Level 1) within your theory are models $(M,E)$ of some version $T$ of set theory (perhaps quite a different one), with $T$ also being a formal object of the theory. It can be a bit disconcerting to manipulate similar objects at Levels 1 and 2 simultaneously.

Now I'll try to answer your questions.

  1. I can use naive set theory as my meta theory to develop Mathematical logic, right? (In this way I would be able to use sets, functions, relations, and other “structures” because they come from it)

The answer is yes. But think about what you mean by "develop mathematical logic."

If what you mean is to set up the basic syntactic system in which you will prove mathematical statements, then you need, at most, some very obvious statements about finite sequences of characters on paper. While set theory is a convenient language in which to discuss this, it isn't necessary.

If what you want to do is more elaborate, for example to prove rigorously various facts about what can or can't be proved from particular sets of axioms, then the specific set of axioms of your metatheory may be very relevant. For example, if your metatheory is ZFC + (There exists a strongly inaccessible cardinal), then you can prove that the theory ZFC is consistent (by exhibiting a model of ZFC). But if your metatheory is ZFC, then either your metatheory is inconsistent or your metatheory cannot prove that the theory ZFC (a formal object within the metatheory) is consistent. (This is a consequence of Gödel's incompleteness theorem.)

  1. Back to sets: We already use some ideas from logic to work in naive set theory.. wouldn’t that be circular?

I think what you mean here is that, for example, we use the idea of conjunction in defining the intersection of two sets, etc. Please correct me if I'm wrong.

This is not a problem because such uses of conjunction ultimately amount to using the conjunction symbol in certain places in formal proofs. As long as you are obeying the rules of formal proofs, including the rules for manipulations involving the conjunction symbol, you are following the rules of the game.

Of course, what you are really doing is carrying out arguments which, psychologically speaking, are in the semantic realm, but which could be reduced with some work to formal syntactic proofs.

  1. I was intrigued by the ”arithmetic way”. Can you please give an example of how that can be done? Or some literature?

With respect to syntax, the arithmetization of formal proofs will necessarily be discussed in any source that presents a detailed proof of Gödel's incompleteness theorem. Two possible sources would be Mathematical Logic by Cori and Lascar, or Fundamentals of Mathematical Logic by Hinman. The theory of recursive functions is fundamental for this.

On the other hand, I'm just not familiar at all with the ways in which semantic logical concepts (most naturally developed within set theory) can be transposed into arithmetic. For example, I don't know how complex a system of arithmetic you would need in order to formulate and prove meaningful versions of Gödel's completeness theorem. In any case, I anticipate that this area must be highly technical. If all you care about is soundness, I suspect that you can get by with less. Once you've learned enough logic (using set theory as your metatheory), you could ask a new question specifically about this.

The bottom line, then, is that there is no vicious circle if all you want to do is use your system of formal proofs to write down mathematics. However you must take a certain amount of mathematics (a subset of ZF) on faith if you wish to prove that your system is sound and complete.

Anonymous
  • 1,010
  • Thank you so much for that answer. I have a couple of questions: $1)$ I can use naive set theory as my meta theory to develop Mathematical logic, right? (In this way I would be able to use sets, functions, relations, and other “structures” because they come from it) $2)$ Back to sets: We already use some ideas from logic to work in naive set theory.. wouldn’t that be circular? $3)$ I was intrigued by the ”arithmetic way”. Can you please give an example of how that can be done? Or some literature? Once again, thank you for your attention. – Air Mike Aug 19 '21 at 22:26
  • Also, I would like to say that you’re answer really goes in the direction of my question, i.e., I want a solid basis to develop some logic, so that I can develop set theory (and therefore the basis for logic must not include sets, otherwise it would be circular) – Air Mike Aug 19 '21 at 22:28
  • You may be interested in my answer for elaboration of how much you need for basic meta-theorems for FOL. =) – user21820 Jun 07 '22 at 23:35
0

I think your main issue is that you do not realize that your version of PROP is impredicative, and such kind of definitions would arguably require significant set-theoretic commitment. After all, you are asking for the smallest set that is closed under some rules, so on its face that requires you to be able to contemplate all such sets and somehow obtain their intersection from them as a whole.

But that is not the way to do things, if you want to minimize philosophical assumptions! Instead of impredicative definitions, you need predicative definitions. PROP can be easily defined as follows. By recursion, define $F∈ℕ→Str$ such that $F(0)$ is the set of propositional atoms and for each $k∈ℕ$ we have that $F(k+1)$ is the set of formulae that are in $F(k)$ or can be obtained by joining some members of $F(k)$ by one propositional connective. Then let $U$ be the union of $F(k)$ over all $k∈ℕ$. $U$ is the set you want!

If you work within a set theory, you generally would have to construct $U$ via some comprehension axiom, and you can even do so in a single step without any recursion! Concretely:

Let $A$ be the set of propositional atoms.

Define $J(S) = \{ \ x : x{∈}S ∨ ∃y,z{∈}S\ ( \ x = y+z \ ) \ \}$ for any set $S$.

Let $U = \{ \ x : x{∈}Str ∧ ∃k{∈}ℕ\ ∃F{∈}ℕ_{≤k}{→}Str\ ( \ \\\quad F(0) = A ∧ ∀i{∈}ℕ_{<k}\ ( \ F(i{+}1) = J(F(i)) \ ) ∧ x{∈}F(k) \ ) \ \}$
.

Regarding how much set theory you actually need to get all the important theorems of logic, it is actually very little! See this post on building blocks. Note that encoding into subsystems of SOA (second-order arithmetic) require Godel coding if you want to really minimize assumptions, and with that ACA0 can prove the incompleteness theorems and the semantic completeness theorem for countable FOL theories, and but if you do not like Godel coding you can construct a similar system based on TC.

In particular, just as ACA0 is essentially obtained from PA by reifying definitions over PA (as sets), we can define TC2 as TC for strings plus a suitable form of induction plus comprehension axioms that allow you to construct any set of the form $\{ \ x : Q(x) \ \}$ where $Q$ is a property over TC. Due to the inherent nature of strings, we will have a much easier time encoding finite sequences of strings as a single string. See here under "Encoding program execution in a string".

It is also possible to argue that ACA0 or TC2 do not actually involve any set-theoretic assumptions, since at no point do we truly treat the 'sets' as objects. Instead, you can think of each 'set' as simply its defining property, which is just a string! In fact, each of them has a countable intended model matching this view! (The 'sets' in this model are exactly the sets that can each be computed using some finite Turing jump.)

Note that we do not appear to have any uncountable collection in real life, so it is not a big deal that ACA0/TC2 has a countable intended model, nor that they can only prove theorems about countable FOL theories (which include themselves). In short, ACA0/TC2 arguably can "handle propositional and first order logic without using set theory".

If you want to tackle higher logic, such as involving uncountable theories or uncountable models, you will definitely have to have something quite like a full-blown set theory. But the point is that you do not really need set theory for basic results about your chosen foundational FOL theory.

user21820
  • 57,693
  • 9
  • 98
  • 256