This question was asked by a colleague, and I need help on how to handle it. Here's the setup. For context, this is about artificial intelligence models applied to sensor data.
- Let's start with $N$ sensors. From these sensors, $n$ different sensors are selected randomly.
- Some processing is done with data from selected sensors.
- A new selection of $n$ sensors is done randomly (previous ones can be selected again), and processing is done with this new set.
Now, let's say that this selection process is done $m$ times. What is the probability that every pair of sensors was selected at least once during this process?
That last part is the tricky one. For example, with 5 sensors and 3 sensors per selection, what is the probability that sensor pairs 1-2, 1-3, 1-4, ..., 3-5 and 4-5 were selected at least once after $m$ drawings?
I had a look at hypergeometric distribution, but I'm not sure if this is what I need.
I hope this is clear enough. Regards
Francois