I read an article that describes how to distinguish between real and fake sequences of coin tosses, with good reliability: we should check the longest run of heads (in real sequences of length $n$, the longest run of heads is about $\log_2 n-1$, so if a sequence's longest run of heads is much different than that, there is reason to suspect that the sequence is fake). If that's not enough, we could use more sophisticated checks, such as counting the successive ordered pairs of results.
That made me wonder about how to distinguish between real and fake prime gap sequences.
Suppose an alien from another planet gives you a sequence, claiming that it is the $1000$ consecutive prime gaps starting from the $10^{100}$th prime gap. How could we check if the alien is likely to be lying?
(With current human technology, we are unable to find the $10^{100}$th prime number.)
I know of the following checks:
- The terms must be even.
- Some subsequences are impossible, for example $8$ followed by $14$.
- The arithmetic mean of the terms should be approximately $\log (10^{100})\approx230$.
- The geometric mean of the terms should be approximately $e^{-\gamma}\log (10^{100})\approx 129$ ($\gamma$ is the Euler-Mascheroni constant)
- $30$ should occur frequently.
What are some other checks?
Remarks:
Of course, there is no way to be sure that any given sequence of coin tosses is fake, and there is usually no way to be sure that the alien is lying (so I have included the "soft-question" tag). But in both cases, we can make good guesses based on known mathematics.
I picked the numbers $1000$ and $10^{100}$, but there is nothing particularly special about them; feel free to adjust them, if that would allow you to provide a more insightful answer.
One might assume that if the alien were lying, it could still anticipate how we would check, and thus avoid detection; but let's not assume that the alien is that sophisticated.