If I'm sampling from N items, how many samples are needed to get all N items at least once?

Question

I have a situation where I have N items in the population (e.g. these are item IDs in some inventory). Suppose I can sample and get back 1 item at a time, where each item has an independent and uniform distribution of being sampled; for example, I would call some function sample() and it would return one of the N items with probability $\frac{1}{N}$.

On average, how many samples would I need to observe all N items at least once? In other words, how many samples would I need to get the entire population? Obviously, I would need at least N samples as a lower bound, but is there a tighter bound?

Thank you for any help. I'm a software engineer, so please be gentle with any math.

This is known as the Coupon Collector Problem ...easy to find information about it online (that link is a good start). — lulu, Oct 07 '16 at 17:57
@RossMillikan: The Wikipedia page says that the answer is O(N log N), but the linked question "Using Recursion to Solve Coupon Collector" does not mention that form. Why is that? — stackoverflowuser2010, Oct 07 '16 at 18:21
@stackoverflowuser2010: $\displaystyle \sum_{i=1}^{10} \dfrac{10}{i} \approx 10\left( \log_e(10) + \gamma + \dfrac{1}{2\times 10}-\dfrac{1}{12 \times 100^2}+\cdots\right)$ where $\gamma \approx 0.5772156649$. Search for "Harmonic number" for more information — Henry, Oct 07 '16 at 18:52

score 1 · Answer 1 · answered Oct 07 '16 at 17:59

1

There is no such number. Even if you only have two items, it is possible that you can sample the pair randomly one million times and still select the same item every single time.

It's highly unlikely, but not impossible.

You probably want to ask how many samples are needed to guarantee seeing every item with a probability of (say) $99.99\%$.

Am I misunderstanding?

answered Oct 07 '16 at 17:59

MPW

43,638

Quote: "On average". – Did Oct 07 '16 at 19:38

If I'm sampling from N items, how many samples are needed to get all N items at least once?

1 Answers1