0

I am curious what justifies the use of shorthands or abbreviations for certain formulas in first order logic. In particular, I'm interested in building up some basic mathematical principles from the zfc axioms and there's many times when a new symbol is introduced which expresses some logical statement more succinctly. My question is how this is logically or syntactically justified. For example, in the Wikipedia article for Zermelo Fraenkel set theory under the Axiom of Extensionality it is stated that $x=y$ can be taken to be shorthand for

$$ \forall z[ z \in x \Leftrightarrow z \in y] \land \forall w [ x \in w \Leftrightarrow y \in w] \hspace 1in (1) $$

The way I interpret that is anywhere we see the above expression we can delete it and "plug in" $x=y$.

My questions are as follows.

First, is there a way to notationally indicate that the expression $x=y$ and the expression in $(1)$ are related in such a way (logically equivalent?) that one can (without any intervening steps?) simply replace one by the other in any expression?

Second, which rules of logic allows us to create definitions like $x=y$ and which rules allow us to make the replacements indicated above?

As a slightly simpler example or a hint at what I feel might be the answer, suppose we have the two axioms

$$ B \Leftrightarrow C $$ $$ A \land B $$

Now in a few lines we could derive $A \land C$. However, it seems to me that if we have ANY formula involving $C$ we could in some number of lines derive the same expression but with $B$ replaced by $C$. This would especially be useful in very complicated expressions where $B$ might be embedded pretty deeply in the expression and it would be hard to get at $B$ to replace it with $C$ using the usual rules of deduction. Is there something that allows us to replace all instances of $B$ with $C$ if we have an expression like $B \Leftrightarrow C$.

I would appreciate someone pointing me towards the correct concepts I should be looking up and some proper notation for the concepts I am trying to express!

Also note: I don't really know the meaning of things like $\vdash$, $\vDash$, or the difference between $\leftrightarrow$, $\Leftrightarrow$, $=$, $\equiv$, which are sometimes brought up as having different meanings. Maybe these different meanings are key to my question.

Jagerber48
  • 1,431

1 Answers1

1

As Mauro ALLEGRANZA said, the most common way to deal with this problem in the context of first-order logic and similar systems is to handle it entirely metalogically.

For example, if we are working in a first-order logic without equality in which we've formalized the ZFC axioms say, then we can define a new binary relation symbol which we'll write as $=$ and add a new axiom that states $$\forall X,Y.X = Y \iff (\forall x. x \in X \iff x \in Y)\land(\forall Z.X\in Z \iff Y \in Z)$$

This produces a new first-order theory, call it ZFC', that has in addition to this relation symbol and axiom, all the function, relation symbols, and axioms of ZFC. As Mauro ALLEGRANZA states, this is called extension by definition.

You can additionally prove, metalogically by induction over all formulas, that for every formula $\varphi$, if $x = y$ then $\varphi(x)\iff\varphi(y)$. This is called indiscernibility of identicals. This shows that it would be admissible to add as a rule of logic a rule that, in a single step, let you replace $x$ with $y$ in any formula if you had proven $x = y$. Indeed, first-order logic with equality has such a rule for the built-in notion of equality, and if you formulated ZFC in this logic you'd likely have the above axiom as well, only in this context it wouldn't be a definition but instead a quite significant statement that could lead to inconsistency if you made a mistake.

Indiscernibility of identicals gives you the "one-step" proof rule you desire as far as introducing new terms, i.e. new function symbols and constants. Often proof systems don't have an equivalent for logical equivalence. This is usually because there is no way in the object language to talk about formulas parameterized by propositional variables. (In higher-order logics, we can, though in some higher-order logics there isn't a distinction between predicates and terms, so the indiscernibility of identicals handles both cases.) We can prove, metalogically, that logical equivalence is a congruence with respect to the logical connectives and this warrants replacing subformulas with their logical equivalents. This operation is extremely common when informally reasoning about both formal and informal statements.

A notion of definition in the object language is more common in programming languages and the languages used by proof assistants which are often based on some form of type theory. (For many proof assistants, the interface is actually a programming language which then serves as the meta-language [that's what ML in SML stands for], and so the definition facilities of the host language are used to handle such abbreviations.) These languages are very often higher-order and support a notion of local definition which would be a bit awkward to deal with in a first-order language. An example is the let terms in the Calculus of Inductive Constructions which is the core language of Coq. Many type theories and programming languages are lambda calculi where a simple form of local definition can be defined via the (metalogical) abbreviation of $(\mathsf{let}\ x = t\ \mathsf{in}\ E) := ((\lambda x. E)t)$.

I vaguely recall approaches to formalize a more object language notion of definition in a more first-order logic context, but I don't have any good pointers into literature of that type. As I mentioned, the programming language theory and type theory literature discusses notions of definition and definitional equality quite a bit, but this is often in a programming and/or higher-order context.

  • This addresses the question for me! The link to extension by definition helped, however, your comment about the metalogical induction really got to the heart of what I was asking. The wikipedia article states that the conclusion holds but makes no reference to the metalogical proof. The induction proof makes sense given my second example. My intuition was that we should be able to build up the replacement rule for any formula using the fact that it clearly holds for simple formulas. – Jagerber48 Jan 10 '18 at 16:38