I know you asked for a rigorous mathematical proof but since this is the only popular thread with this question (which I found) I am going to give a solution which helped me think of this. It is a pseudo game theoretic set up for the problem
Set up: There are 2 players - A and B. The payoffs are symmetric and the game is a simultaneous game where at each stage both players are asked if they know the other's player's draw, and each player has the set of actions {yes, no} (as the answer to "do you know which number the other player has?"). Payoffs are just 1 if 1 player gets the correct answer, 0 if both players say "no" and -1 if they get the wrong answer (payoffs aren't necessary to fully understand this)
Okay the key portion: this is where I kind of modify the question. The question I am asking is (and what I believe that the actual question is also effectively asking in the end) is: There is a game where two players draw a card identically, and they get numbers such that the difference between the numbers is 1 (think of this as nature's move). The numbers are drawn from a set of positive integers. What is the optimal NE strategy that they can use two effectively find out what their numbers are?
Before we go ahead, for simplicity a payoff (a, b) is same as (b, a).
Also, we name states of the world as 1 if the state is (1,2), 2 if the state is (2, 3), etc. Basically state = a
Okay so I'm going to outline the strategy here. Generally a player's response will be determined by i) the other players action in $timeperiod_{t-1}$ and the draw of the player. How the other player's action determines the strategy is to see how consistent the other player's strategy is to the optimal strategy if the state had been $state_{n-1}$.
Also also, when going from state to state and checking for consistency, player i puts himself in player j's shoes had the state been $state_{n-1}$. i.e. if we are in state 3, and player A is checking for consistency with state 2, he thinks of himself as player B.
Let's explore the strategy:
If player i's number is 1, answer "Yes" in time period 1. The other player has 2. This is trivial. Given that the other player's number is 1 plus/minus 1, which makes it either {0, 2}, combined with the fact that we draw from positive integers, (0 can't be possible), the other player has to have 2.
Let's go to state 2 (i. e. (2,3)).
- Both players start with prior beliefs of ((1, 3), (2, 4)). Now since neither player has 1, both answer No in the first time period.
- But in time period 2, player A realizes that player B's actions in period 1 weren't consistent with the strategy developed for state 1. Given this, he eliminates the belief that the player B has 1.
- Player A hence now knows the other player has to have 3. In period 2, he says "Yes" [and player B says "No"] and states that player B has 2. Player B says "No".
Let's go to state III (i.e. (3,4)). Players start with prior beliefs of ((2,4),(3,5)).
- Player A and B say "NO" in period 1 since neither drew the number "1".
- Player A and B notice the other player's actions in period 1, to be inconsistent with state 1. Therefore neither player has drawn "1" (both players knew this).
- So what stops player A from saying "YES" in this period [as he would've done we were in state 2]? Notice that in state II, even when player 1 ended the game with a yes, player B, who had his apriori beliefs as the same as player A in this situation didn't call out yes. He infact calls out a "No".
- Time period 3, "belief update" wise (vague idea alert) both players are in state III. Now each player checks for consistency with strategy if we were in state II. Player A immediately notices that period 2's play wasn't consistent with the optimal strategy for state II, hence he narrows his solution set from {2, 4} to just {4}.
- He proceeds to say "yes" is period 3, and says that the other person's number is 4.
We can use this method of reasoning to every single possible state of the world.
In generic notation here is the strategy more explicitly:
- If time period is 1, check if you have drawn the number 1. If yes, end the game and call out the other player's number as 2. If no, say "No"
- If time period != state, say "No". (A longform way to write this would be to explicitly condition the answer on the apriori beleifs and the time period, but this is easier)
- If time period == state, check for consistency of player j's actions with the optimal strategy of $state_{n-1}$. If inconsistent, update beliefs, call out other player.
I once again apologize for the psuedo maths/game theory. I explored this in order to make myself understand the solution (what does it mean that both players know each other is "extremely logical")? This is basically what I came up with. Also apologies for inconsistent notation.