Let X be the smallest value obtained when k numbers are randomly chosen from the set 1,...,n. Find E[X] by interpreting X as a negative hypergeometric random variable.
This is Self Test Exercise 7.7 of Sheldon's A First Course in Probability.
The way I approached this is to first consider (for example) P(X = 1). Using the suggestion to take X as a negative hypergeometric random variable, we have
$$P(X = 1) = \frac{\binom{1}{1}\binom{n - 1}{k - 1}}{\binom{n}{k}}$$
My idea was that we have one element 1 and the rest $k - 1$ elements from ... the remaining $n - 1$ elements. Similarly, for $P(X = 2)$, we have one way of getting the minimum element $2$, and $\binom{n - 2}{k - 1}$ ways of getting the other elements (well, except 1).
Similarly, we can deduce that
$$P(X = i) = \frac{\binom{n - i}{k - 1}}{\binom{n}{k}}$$
As a result, I got
$$E(X) = \sum_{i = 1}^n i \times \frac{\binom{n - i}{k - 1}}{\binom{n}{k}}$$
The answer in the book is $\frac{n + 1}{k + 1}$, which is just so much more elegant than what I came up with, and I'm not seeing how my answer reduces to theirs.
My questions are:
- Is my approach correct? If not, what mistake(s) did I make in my analysis?
- If my answer is correct, how do I get to the expected result?