We make the simplifying assumptions:
- Equal chance of boy or girl.
- Victoria is a girl's name and is never given to boys.
Let's ask the generic question:
- Mr Smith has two children, whom I've arbitrarily labeled Child A and Child B. Child A is girl with property Q or Child B is a girl with property Q. What is the chance that Child A is a girl and child B is a girl.
Let's rephrase the question to be more amenable to Bayesian Analysis.
- A priori, we know that Mr Smith has two children, whom I've arbitrarily labeled A and B. Given that (Child A is a girl and Child A has property Q) or (Child B is a girl and Child B has property Q), what is the chance that Child A is a girl and child B is a girl.
Let
- $A_G$ be the statement Child A is a girl
- $B_G$ be the statement Child B is a girl
- $A_Q$ be the statement Child A has property Q
- $B_Q$ be the statement Child B has property Q
We ask for
\begin{align}
P([A_G \text{ and } B_G] \text{ given } [(A_G \text{ and } A_Q) \text{ or } (B_G \text{ and } B_Q )])
\end{align}
Now, $P (A \text{ given } B) = \frac{P(A \text{ and } B)}{P(B)}, so our expression above is equal to
\begin{align}
&\quad\frac{P([A_G \text{ and } B_G] \text{ and } [(A_G \text{ and } A_Q) \text{ or } (B_G \text{ and } B_Q )])}{P((A_G \text{ and } A_Q) \text{ or } (B_G \text{ and } B_Q ))}\\
& = \frac{P([A_G \text{ and } B_G \text { and } A_Q] \text{ or } [A_G \text{ and } B_G \text { and } B_Q])}{P((A_G \text{ and } A_Q) \text{ or } (B_G \text{ and } B_Q ))}
\end{align}
For now, let us assume that Q implies you are a girl. So $A_Q \implies A_G$, and $B_Q \implies B_G$. This means $P(A_Q \text{ and } A_G) = P(A_Q)$.
We can then simplify the above statement to:
\begin{align}
\frac{P([B_G \text { and } A_Q] \text{ or } [A_G \text { and } B_Q])}{P(A_Q \text{ or } B_Q)}
\end{align}
Now, $P(A \text{ or } B) = P(A \text{ and }\neg B) + P (\neg A \text{ and } B) + P(B \text {and} A)$. ($\neg$ means not).
So, expanding the top and the bottom, our formula becomes,
\begin{align}
\frac{P([B_G \text { and } A_Q] \text{ and } \neg [A_G \text { and } B_Q]) + P(\neg [B_G \text { and } A_Q] \text{ and } [A_G \text { and } B_Q]) + P([B_G \text { and } A_Q] \text{ and } [A_G \text { and } B_Q]) }{P(A_Q \text{ and } \neg B_Q) + P(\neg A_Q \text{ or } B_Q) + P(A_Q \text{ and } B_Q) }
\end{align}
Now our equation is a mess. But we did this so we can apply some symmetry arguments, and because it will be easier to understand. Let's simplify the top a bit first though.
We note that
\begin{align}
&\quad[B_G \text { and } A_Q] \text{ and } \neg [A_G \text { and } B_Q]\\
& = [B_G \text { and } A_Q] \text{ and } [\neg A_G \text { or } \neg B_Q]\\
& = B_G \text { and } A_Q \text { and } \neg B_Q &\text{since $A_Q \implies A_G$}
\end{align}
Our formula then simplifies to
\begin{align}
\frac{P(B_G \text { and } A_Q \text{ and } \neg B_Q) + P( A_G \text { and } B_Q \text { and }\neg A_Q) + P([B_G \text { and } A_Q] \text{ and } [A_G \text { and } B_Q]) }{P(A_Q \text{ and } \neg B_Q) + P(\neg A_Q \text{ or } B_Q) + P(A_Q \text{ and } B_Q) }
\end{align}
We now apply the symmetry argument, noting that Child A and Child B are arbitrary labels, and as such
\begin{align}
P(B_G \text { and } A_Q \text{ and } \neg B_Q) &= P( A_G \text { and } B_Q \text { and }\neg A_Q)\\
P(A_Q \text{ and } \neg B_Q) &= P(\neg A_Q \text{ or } B_Q)
\end{align}
Our formula simplifies to
\begin{align}
&\quad\frac{2 P(B_G \text { and } A_Q \text{ and } \neg B_Q) + P([B_G \text { and } A_Q] \text{ and } [A_G \text { and } B_Q]) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }\\
&=\frac{2 P( [A_Q \text{ and } B_G] \text { and } \neg B_Q ) + P(A_Q \text { and } B_Q ) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }\\
\end{align}
Now, we at last note that $P (A \text{ and } \neg B) = P(A) - P (A \text{ and } B)$ to rewrite the formula as:
\begin{align}
&\quad\frac{2 P( [A_Q \text{ and } B_G] \text { and } \neg B_Q ) - 2 P(A_Q \text{ and } B_G \text{ and } B_Q) + P(A_Q \text { and } B_Q ) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }\\
&=\frac{2 P( A_Q \text{ and } B_G) - 2 P(A_Q \text{ and } B_Q) + P(A_Q \text { and } B_Q ) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }\\
& = \frac{2 P( A_Q \text{ and } B_G ) - P(A_Q \text{ and } B_Q) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }
\end{align}
In almost all cases, $A_Q$ and $B_G$ are independent. That is, a priori, $A$ having the property $Q$ does not affect $B$'s sex. One of the only case where $A_Q$ affects $B_G$ is if $A_Q$ states that A is the only girl. However, we cannot assume that $A_Q$ is independent of $B_Q$. For example, $A$ being the older child means that B cannot be the older child.
We then write our formula as
\begin{align}
&\quad\frac{2 P (A_Q)P( B_G ) - P(A_Q \text{ and } B_Q) }{2 P(A_Q \text{ and } \neg B_Q) + P(A_Q \text{ and } B_Q) }\\
&=\frac{2 P (A_Q)P( B_G ) - P(A_Q \text{ and } B_Q) }{2 P(A_Q \neg B_Q) - 2P (A_Q \text{ and } B_Q) + P(A_Q \text{ and } B_Q) }\\
& = \frac{2P (A_Q)P( B_G ) - P(A_Q \text{ and } B_Q) }{2 P(A_Q ) - P (A_Q \text{ and } B_Q) }
\end{align}
Finally, we note that $P(B_Q \text{ and } A_Q) = P(B_Q \text{ given } A_Q)P(A_Q)$
We finally have
\begin{align}
&\quad \frac{2P (A_Q)P( B_G ) - P(B_Q \text{ given } A_Q) P(A_Q) }{2 P(A_Q ) - P (B_Q \text{ given } A_Q) P(A_Q)}\\
& = \frac{2P( B_G ) - P(B_Q \text{ given } A_Q) }{2 - P (B_Q \text{ given } A_Q) }
\end{align}
Finally, We have, given the following assumptions:
- Having the property Q implies being a girl. We can handle the case of Q not implying being a girl as well. Shown below.
- Child A having property Q does not affect Child B's being a boy or a girl.
\begin{align}
\frac{P( B_G ) - \frac{1}{2} P(B_Q \text{ given } A_Q) }{1 - \frac{1}{2} P (B_Q \text{ given } A_Q) }
\end{align}
or if we set $P(B_G) = 0.5$
\begin{align}
\frac{1 - P(B_Q \text{ given } A_Q) }{2 - P (B_Q \text{ given } A_Q) }
\end{align}
We have done this to show that the probability ultimately depends on how likely it is that Child B has property Q given that Child A has property Q, a priori.
Let us look at two extreme cases. We'll also use these cases to demonstrate how to handle the case when Q does imply being a girl.
A) $B_Q$ cannot occur if $A_Q$ is true.
Mr Smith has two children, whom I've arbitrarily labeled Child A and Child B. Child A is girl and is oldest or Child B is a girl and is oldest. What is the chance that Child A is a girl and child B is a girl?
Let $Q$ be the property of being the oldest child. Q does imply being a girl. However, we can simply change $Q$ to be the property of "being a girl, and being oldest". This does not change our question at all.
Since only one child can be the oldest girl, $P(B_Q \text{ given } A_Q) = 0$. And our formula simplifies to $P(B_G) = 0.5$.
B) $B_Q$ and $A_Q$ are empty statements. Something that tells us nothing about A and B. For example
- Mr Smith has two children, whom I've arbitrarily labeled Child A and Child B. Child A is girl (and is human) or Child B is a girl (and is human). What is the chance that Child A is a girl and child B is a girl?
Once again, an empty statement (or being human) does not imply being a girl. So Q must therefore be the statement "is a girl (and is human)". That means Q=G.
\begin{align}
&\quad\frac{1 - P(B_Q \text{ given } A_Q) }{2 - P (B_Q \text{ given } A_Q) }\\
&=\frac{1 - P(B_G \text{ given } A_G) }{2 - P (B_G \text{ given } A_G)}\\
& = \frac{1 - P(B_G ) }{2 - P (B_G )} &\text{since $B_G$ and $A_G$ are independent}\\
& = \frac{1}{3}
\end{align}
In summary, the more unlikely $P(B_Q \text{ given } A_Q)$ is, the more the probability tends toward $\frac{1}{2}$. We should think of $P(B_Q \text{ given } A_Q)$ as how unique the property $Q$ is. If Q is super unique, then we can identify the child with property Q, and the probability tends toward $\frac{1}{2}$. If Q is vague, and it is very possible that both child has property Q, then we cannot use property Q to identify the child, and the probability tends toward $\frac{1}{3}$.
To answer the question
Mr Smith has two children, whom I've arbitrarily labeled Child A and Child B. Child A is a girl named Victoria or Child B is a girl named Victoria. What is the chance that Child A is a girl and child B is a girl?
It depends how likely the parent named both girls Victoria.