Why is this true?
$\exists x\,\big(P(x) \Rightarrow \forall y\:P(y)\big)$
Why is this true?
$\exists x\,\big(P(x) \Rightarrow \forall y\:P(y)\big)$
Since this may be homework, I do not want to provide the full formal proof, but I will share the informal justification. Classical first-order logic typically makes the assumption of existential import (i.e., that the domain of discourse is non-empty). In classical logic, the principle of excluded middle holds, i.e., that for any $\phi$, either $\phi$ or $\lnot\phi$ holds. Since I first encountered this kind of sentence where $P(x)$ was interpreted as "$x$ is a bird," I will use that in the following argument. Finally, recall that a material conditional $\phi \to \psi$ is true if and only if either $\phi$ is false or $\psi$ is true.
By excluded middle, it is either true that everything is a bird, or that not everything is a bird. Let us consider these cases:
Since it holds in each of the exhaustive cases that there is something such that if it is a bird, then everything is a bird, we conclude that there is, in fact, something such that if it is a bird, then everything is a bird.
Since questions about the domain came up in the comments, it seems worthwhile to consider the three preconditions to this argument: existential import (the domain is non-empty); excluded middle ($\phi \lor \lnot\phi$); and the material conditional ($(\phi \to \psi) \equiv (\lnot\phi \lor \psi)$). Each of these can be changed in a way that can affect the argument. This might not be the place to examine how each of these affects the argument, but we can at least give pointers to resources about the alternatives.
Hint: The only way for $A\implies B$ to be false is for $A$ to be true and $B$ to be false.
I don't think this is actually true unless you know your domain isn't empty. If your domain is empty, then $\forall y: P(y)$ is true "vacuously," but $\exists x: Q$ is not true for any $Q$.
The examples with birds or drinkers were designed to make this simple logical truth sound paradoxical. Perhaps an example from ordinary mathematics will dispel the air of paradox. Consider a nonempty set $X$ and a function $f:X\rightarrow\{0,1\}$. Does the following proposition seem strange and unintuitive?
Theorem. There is an element $m\in X$ such that, if $f(m)=1$, then $f(x)=1$ for all $x\in X$.
Proof. The function $f$ has an absolute minimum. (The domain is nonempty and the range is a finite set of numbers.) Let $m$ be a point in $X$ where $f$ attains its minimum value. Clearly, if $f(m)=1$, then $f(x)=1$ for all $x$.
Really, all that logical formula is saying is that the truth value ($1$ for true, $0$ for false) of the "propositional function" $P(x)$ attains its minimum.
Here's a different approach: this can be fairly-straightforwardly proved from simple boolean algebra, starting from a tautology.$\quad$ $$ ((\forall y)P(y)) \lor (\lnot (\forall x) P(x)) \\ ((\forall y)P(y)) \lor ((\exists x)\lnot P(x))\\ (\text{anything} \Rightarrow (\forall y)P(y)) \lor ((\exists x)(P(x) \Rightarrow \text{anything})) \\ ((\forall x)P(x) \Rightarrow (\forall y)P(y)) \lor ((\exists x)(P(x) \Rightarrow (\forall y)P(y))) \\ ((\exists x)(P(x) \Rightarrow (\forall y)P(y))) \lor ((\exists x)(P(x) \Rightarrow (\forall y)P(y))) \\ (\exists x)(P(x) \Rightarrow (\forall y)P(y)) $$
In classical logic the following equivalence is logically valid: $$ \exists x (\varphi\Rightarrow\psi)\Longleftrightarrow(\forall x\varphi\Rightarrow\psi) $$ providing that $x$ is a variable not free in $\psi$. So the formula in question is logically equivalent to $\forall xP(x)\Rightarrow\forall yP(y)$.
Looking at the poblem from a slightly different perspective. Either (i) all objects in the domain of discourse have property $P$, i.e. $\forall y P(y)$ is true or (ii) there is $a$ in the domain for which $P$ fails, i.e. $\neg P(a)$ is true. In (i) $P(x)\Rightarrow\forall y P(y)$ must be true, so $\exists x(P(x)\Rightarrow\forall y P(y))$ is true. In (ii) $P(a)\Rightarrow\forall y P(y)$ must be true, therefore the sentence in question must be true as well.
In mathematics, if not in philosophy, the domain of quantification is usually made explicit in statements. If we let the domain be some set $U$, your statement becomes
$$\exists x\in U (P(x) \Rightarrow \forall y\in U (P(y))$$
or equivalently
$$\exists x(x\in U \wedge (P(x) \Rightarrow \forall y(y\in U\Rightarrow P(y))))$$ Then you can formally prove this statement is true iff $U$ is non-empty as sketched here:
For non-empty $U$, consider two cases:
$$\forall y\in U (P(y))\lor \neg\forall y\in U (P(y))$$
In each case, obtain
$$\exists x\in U (P(x) \Rightarrow \forall y\in U (P(y))$$ For empty U, assume to the contrary that
$$\exists x\in U (P(x) \Rightarrow \forall y\in U (P(y))$$
and obtain a contradiction.
Full text of formal proof at http://dcproof.com/Mats.htm
The location of the parentheses is crucial here. Note that $(\exists x)(P(x) \Rightarrow (\forall y) P(y))$ is very different from $(\exists x)(P(x)) \Rightarrow (\forall y) P(y))$. To see why the former statement is true, consider its negation which is
$$\forall x (P(x) \wedge \neg \forall P(x)) \equiv \forall x(P(x) \wedge \exists x \neg P(x))$$
which is clearly false if your domain is nonempty. In classical logic, we restrict to nonempty domains, so your original statement will always be true.
Translate it to english:
In any universe of things, there can always be found an instance which, if it has some property, then all other things have that property.
For any given property, you can find an instance in a set of things to make it true. (As long as that set is not the empty set, by the way; existential assertions are false for empty sets.)
If none of the things have the property, then you can choose any one of them. Call that chosen instance $a$. $P(a) \Rightarrow (\forall y) P(y)$ is vacuously true, because $P \Rightarrow Q$ is true whenever $P$ is false.
If all of the things have the property, then you can also choose any of them to be the instance $a$. $P(a) \Rightarrow (\forall y) P(y)$ is then true because $P(a)$ is true, and $(\forall y) P(y)$ is true.
If some things have the property but not others, then you just choose one which does not have the property to be the instance $a$. Then $P(a) \Rightarrow (\forall y) P(y)$ is, again, vacuously true on account of $P(a)$ being false.
But let's look at it by taking the contradiction and seeing where it leads: $\neg(\exists x)(P(x) \Rightarrow (\forall y) P(y))$
In any universe of things, there cannot be found an instance such that if it has some property, then all other things have that property.
we can move the negative inside: $(\forall x)\neg(P(x) \Rightarrow (\forall y) P(y))$
In any universe of things, for every one of its elements, it is false that if a given property holds true of the element, then it holds for all other elements.
Then we can integrate the $\neg$ into the formula by way of $\neg (Q \rightarrow P) = Q \land\neg P$ to arrive at: $(\forall x)(P(x) \land\neg (\forall y) P(y))$
But this asserts $P(x)$ for all $x$; i.e. that if we think of any property $P$, all things have that property! This is clearly absurd for any nonempty set; we can come up with all kinds of properties $P$ which holds only for some things in a set, or none at all.
To add a proof in Fitch style, natural deduction:
2 | |__________ ~Ex(Px -> VyPy) New Subproof for ~ Introduction
3 | | |________ Pa New Subproof for -> Introduction
4 | | | |_____b variable for Universal Introduction
5 | | | | |____ ~Pb New Subproof for ~ Introduction
6 | | | | | |__ Pb New Subproof for -> Introduction
7 | | | | | | _|_ 5,6 _|_ Introduction
8 | | | | | | VyPy 7 _|_ Elimination
. | | | | | <-------------------- end subpproof
9 | | | | | Pb -> VyPy 6-8 -> Introduction
10 | | | | | Ex(Px -> VyPy) 9 Existentional Introduction
11 | | | | | _|_ 2,10 _|_ Introduction
.. | | | | <---------------------- end subpproof
12 | | | | ~~Pb 5-11 ~ Introduction
13 | | | | Pb 12 ~~ Elimination
.. | | | <------------------------ end subpproof
14 | | | VyPy 4-13 Universal Introduction
.. | | <-------------------------- end subpproof
15 | | Pa -> VyPy 3-14 -> Introduction
16 | | Ex(Px -> VyPy) 15 Existentional Introduction
17 | | _|_ 2,16 _|_ Introduction
.. | <---------------------------- end subpproof
18 | ~~Ex(Px -> VyPy) 2-17 ~ Introduction
19 | Ex(Px -> VyPy) 18 ~~ Elimination
This is the shortest I managed, I don't think a shorter proof using the standard introduction and elimination rules exist
For explanation I compare this "paradox" with:
If the last person has left the room, all people have left the room.
Therefore
There is a person if he has left the room, all people have left the room.
The first statement is a truism , so how can the last statement not be one?
$\tag 1 \exists x \; [P(x) \Rightarrow \forall y \, P(y)]$
Claim: (1) is a tautology, since it leads to an equivalence chain that ends in a well-known tautology:
$\tag 2 \exists x \; [ \neg P(x) \, \text{ or } \, \forall y \, P(y)]$
$\tag 3 \exists x \; [ \neg P(x)] \, \text{ or } \, \forall y \, P(y)$
$\tag 4 \neg \, \forall x \, P(x) \, \text{ or } \, \forall y \, P(y)$
$\tag 5 \neg \, [\forall z \, P(z)] \, \text{ or } \, [\forall z \, P(z)]$
The equivalence chain ends with a tautology, so (1) must also be one.
We can go from (2) to (3) since the existential qualifier distributes over disjunction and $( \exists x) (\forall y) \, P(y) \equiv \forall y \, P(y) $.
In (5) the $[\;]$ brackets were put around identical expressions. It was not necessary to change both $x$ and $y$ to $z$ - they both were 'quantified away' and can be viewed as just logical placeholders.
Finally, this paradox can be viewed as a contortion of counterexample logic. The only useful tautology is (3); you can replace the $\text{or-disjunction}$ with $\text{XOR}$ (exclusive or).
Here is a sequent calculus style proof:
------------ (Id)
P(y) |- P(y)
--------------------- (Weak Right)
P(y) |- P(y), ∀yP(y)
--------------------- (⇒ Right)
|- P(y), P(y)⇒∀yP(y)
------------------------- (∃ Right)
|- P(y), ∃x(P(x)⇒∀yP(y))
--------------------------- (∀ Right)
|- ∀yP(y), ∃x(P(x)⇒∀yP(y))
------------------------------- (Weak Left)
P(z) |- ∀yP(y), ∃x(P(x)⇒∀yP(y))
-------------------------------- (⇒ Right)
|- P(z)⇒∀yP(y), ∃x(P(x)⇒∀yP(y))
------------------------------------- (∃ Right)
|- ∃x(P(x)⇒∀yP(y)), ∃x(P(x)⇒∀yP(y))
------------------------------------- (Contr Right)
|- ∃x(P(x)⇒∀yP(y))
It uses a multi-consequent sequent calculus, so possibly cannot be reduced to intuitionistic logic. Also the contraction structural rule shows that a substructural logic would possibly not do. These speculations could possibly substantiated by some non-classical logic model theoretic considerations.
Taking contrapositive makes the formula easier to parse:
\begin{align}\exists x\big(\,P(x) \to\forall y P(y)\,\big) \tag1\\\iff\exists x\big(\,\exists y\lnot P(y)\to\lnot P(x)\,\big).\end{align}
So, the given sentence $(1)$ is equivalent to:
putting $x:=y$ here clearly shows that this sentence is true.
Alternatively, consider the predicate $P(x),$ which is either atomic or compound.
Assign $s=1$ if $P(x)$ is a logical validity, and $s=0$ otherwise. (In other words, pick the smallest truth value that $P(x)$ attains as the interpretation varies, and denote it by $s.$ )
Putting $x:=s$ into the given sentence $(1)$ shows that it is true.
This drinker's theorem can be simply derived and explained as the implication rule during prenex normal form conversion in classic logic as referenced here:
The rules for removing quantifiers from the antecedent are (note the change of quantifiers):
($\forall$x $\phi$) $\rightarrow$ $\psi$ is equivalent to $\exists$x($\phi$ $\rightarrow$ $\psi$)
The above rule is essentially an easy result of material conditional replacement plus null quantification rule since there's no free occurrences of $x$ in $\psi$. Now we simply replace formula $\phi$ and $\psi$ as follow:
$\phi$=P(x), $\psi$ =$\forall$y P(y)
Then we can have an equivalent proposition (from right hand side to left from above referenced first order equivalence) to prove that is:
($\forall$x P(x)) $\rightarrow$ $\forall$y P(y)
Now it's pretty obvious the last one is a tautological truth. By the way, the reference also contains another "unnatural" implication rule during normal form conversion:
($\exists$x $\phi$) $\rightarrow$ $\psi$ is equivalent to $\forall$x($\phi$ $\rightarrow$ $\psi$)
After some time it's not that hard to realize there's nothing unusual or mystic about above 2 seemingly unnatural equivalences.
Suppose otherwise. Then
$$\lnot\exists(Px\to \forall yPy).\tag{1}$$
Then for any $a$ in the domain, we have
$$\lnot(Pa\to \forall yPy).\tag{2}$$
The only way this can be the case is if
$$Pa\tag{3}$$
while
$$\lnot\forall yPy.\tag{4}$$
But $(4)$ means there exists a $b$ in the domain such that
$$\lnot Pb.\tag{5}$$
But $(2)$ holds for all elements of the domain; in particular,
$$\lnot(Pb\to \forall yPy),\tag{6}$$
which, as above, gives both $Pb$ and $\lnot\forall yPy$; in particular:
$$Pb.\tag{7}$$
But then $(5)$ contradicts $(7)$.