7

In either case, one coin flip resulted in a head and the other resulted in a tail. Why is {H,T} a different outcome than {T,H}? Is this simply how we've defined an "outcome" in probability?

My main problem with {H,T} being a different outcome than {T,H} is that we apply binomial coefficients (i.e. we count subsets of sets) in some common probability problems. But if we take {H,T} and {T,H} to be different outcomes, then our "sets" are ordered, but sets are by definition unordered...

I feel as though the fact that I'm confused about something so basic means that I am missing something fundamental. Any help or insight whatsoever is greatly appreciated!

James Ronald
  • 2,331
  • 1
    It's a question of what you are looking at. If you are looking at the ordered sequence of tosses, then of course $HT$ differs from $TH$. If you are looking at the unordered cumulative results, then the two are the same. Context should tell you which is intended. – lulu Aug 12 '19 at 19:37
  • 2
    As a side note: one reason people sometimes prefer the ordered sequence in this situation is that all outcomes are equally probable. This is false if you look at unordered sequences...${H,T}$ is twice as likely as ${T,T}$ for example. – lulu Aug 12 '19 at 19:38

1 Answers1

12

An experiment has "outcomes." These are the observable results of having performed our experiment. For example, when we reach our hand into a bucket and pull out a ball this outcome might be the color of the ball, or the number on the ball, etc...

A sample space of an experiment is the collection of all possible outcomes of that experiment.

An event is a subset of the sample space.

Depending on the experiment, we may have many different possible sample spaces that we can choose from. Some choices will be more beneficial than others. For example, if our experiment was the "result" of a game of monopoly, we could have each outcome describe every moment in time throughout the game... who purchased a property here, who had to pay taxes there, etc... Alternatively, our outcomes could have been much more brief and just taken into account only the winner of the game.

So long as the events you are interested in describing are in fact subsets of your sample space, that is one of the deciding factors of if a sample space is "valid" to use.


That all being said, one of the reasons why you might choose a particular sample space over another is that you want to use counting techniques to calculate probabilities. We have the nice property that if $A$ is an event and $S$ is our finite sample space that in the event that $S$ is "equiprobable", in other words that every outcome is equally likely to occur, that we have

$$Pr(A)=\dfrac{|A|}{|S|}$$

This is a very powerful tool for calculating probabilities and for this reason we often prefer to use sample spaces where each outcome happens with equal chance.

For this reason, when talking about the experiment of flipping two coins, we often prefer to use the interpretation that the coins are flipped in sequence and order mattering, giving us the sample space of $\{HH,~HT,~TH,~TT\}$

Of course, we may choose to use the sample space where we did not keep track of the order in which these events occurred but just kept track of the total number of times we saw heads versus tails. Just keep in mind that if we do this we will sacrifice the ability to use counting techniques to calculate probability. When outcomes in our sample space are not equally likely to occur, we may not calculate the probability using the above formula. As an example, when playing the lottery you either win or you lose, but certainly winning the lottery doesn't occur half of the time.


As a final aside, it is worth mentioning that it is because we like to work with equiprobable sample spaces so much that we will many times reword problems or reimagine problems in such a way as to let us work with an equiprobable sample space.

For example, for the example where we have a bucket with $10$ indistinguishable white balls and $2$ indistinguishable red balls and we reach in and pick one ball at random and ask what the probability is that the ball was red, we in truth have only the two possible outcomes. The ball was white, or the ball was red, and there is no additional information that we can get by looking at the ball.

We can imagine what would happen if each ball were numbered as well with a unique number, suddenly making the balls all distinguishable. This would allow us to change the sample space into something much more favorable, letting the outcome be the number on the ball as well as the color rather than just the color, making each outcome suddenly become equally likely to occur. Now we can see that the probability is $\dfrac{2}{12}$ for the ball to be red.

JMoravitz
  • 79,518
  • 3
    Thank you very much! This helps immensely and has alleviated more than a week's frustration. I cannot thank you enough. The key realization I didn't make is that the sample space is a choice, and not something set in stone. Wanting to choose your sample space to be one that you can apply combinatorics techniques to makes a lot of sense. Again, thank you so much! – James Ronald Aug 12 '19 at 21:50
  • 2
    @James indeed, it is a choice. And there are sometimes multiple equally good choices. Keep this in mind especially when talking about choosing multiple objects. You could consider order of selection to be relevant or you could choose it irrelevant in many of those examples. So long as you are consistent during your calculations, both should arrive at the same answer even if that answer looks different. Just don't make the mistake of treating order as relevant in numerator but not the denominator or vice versa. – JMoravitz Aug 13 '19 at 00:42