I guess I will not be the first programmer bumping into another variation of the Coupon Collector Problem: please forgive my ignorance!
Here is my understanding of the problem at hand:
Let $S$ be a set of $N$ different kinds of coupons.
Assume that the coupons are shipped into random packs of at most 1 coupon of each type: that is each pack is a sequence $a_1,...,a_N$ such as $0 \leqslant a_i \leqslant 1$.
Assume the frequency $f_i$ of these 2^N packs are known but may differ, $\sum_{i=1}^{2^N}{f_i}=1$
Assume a person wants to collect all the $g$ different kinds of coupons, by randomly picking packs.
I am interested in two quantities:
- What is the probability one person can collect all $N$ coupon kinds by drawing only 2 packs?
- What is the probability that two persons get the exact same final, possibly incomplete, collection by drawing 2 (possibly different) packs?
I have seen proofs of the batched version by inclusion/exclusion, (eg here) that I could possibly use and adapt, but -being a dumb programer- I would be very interested by a pointer to an existing solution to this exact (weighted) problem?