A postman brought N letters to a house with two letter-boxes. Since the two boxes were empty, he puts 1 mail in each of the two mail boxes. Then he chooses one of boxes with probability proportional to number of letters present in that box, and puts the 3rd letter in it. He does this for all subsequent letters. What is the expected number of letters in the box with lower letters?
This problem is from Crazy Postman.
My approach: I used recurrence for solving this problem. Suppose $E_n$ is the expected number of letters in the box with lower letters. Now, $E_{n-1}$ is expected number of letters in box with lower letters if number of letters is n-1, so $n-1-E_{n-1}$ is the expected number of letters in box with higher letters. Now,
$E_n = \frac{n-1-E_{n-1}}{n-1} (E_{n-1}) + \frac{E_{n-1}}{n-1}(E_{n-1}+1)$
Is this correct?
Explanation of the equation: With probability of number of letters in higher letters box, we can say number of letters in lower letters box remains constant and with 1- that probability, number of letters in lower letters box will increase by 1.
My question is can we take the probability by the expected number of letters or do we need the exact number?
Additionally, can someone explain the solution given in the site? I have quoted it here.
Suppose I have a stack of 2 black cards and one red card. Initially I put red between 2 black cards. Now I add black cards randomly between any two cards (so, initially it is either above or below red). Note that the probability that I add the card above the red card, when x-1 is the number of cards above red and y-1 is the number of cards below red is x/(x+y). Let the problem be: if red card is dividing the black cards into two sets, what is the expected number of black cards in the smaller section. So, we see that the two problems are equivalent. Now this way, we are getting all possible combinations in which one red and n black cards can be mixed, we see that the probability that the red card is at height h is independent of h. So, the probability that the smallest box contains n/2 letter or 1 letter (or any number of letters between 1 and n/2) are all same. So, expected number of letters in the smaller box is asymptotically n/4.
I am unable to understand the logic/flow of the solution given. Kindly help.