PiedPiper
21st January 2011, 01:52 PM
I apologize if this thread is off topic for this forum; I'm still quite new here (see post count) and I read the Membership Agreement carefully before posting. Feel free to nuke this thread from orbit if necessary :).
I need a little help (for my own sanity) puzzling out a statistical anomaly. It's been bugging me for years, ever since I took undergraduate statistics, and I haven't found an answer yet. I even wrote a computer program (in Fortran, no less - how retro of me) to simulate 10,000,000 iterations of the following "game" I'm about to describe, and the simulation came out with the exact same results as the math. What I don't understand is the "why", as it flies in the face of logic, and while I accept that some statistics do that, it would be fantastic to have some input and maybe even some "closure" on this issue for me.
Stated simply, the "game" goes like this:
You take ten marbles, and put them into a bag. The marbles are either red or blue, a random number of at least one each, but adding up to ten marbles total. The bag is shaken up, and someone takes out a single marble at random. If the marble is red, the game stops. If the marble is blue, it's put aside (not back in the bag), and another marble is taken out. If it's red, the game stops. If it's blue, it's put aside and another marble is taken out. Etc, etc. The game is over when a red marble has been drawn.
At the end of the game, you count up the number of blue marbles out of the bag, and that is your score for the game. So lets look at some examples of gameplay. For the sake of argument, we're going to say that there are 9 blue marbles in the bag, and one red marble.
Game#1: Boom, red marble right away. Game stops. No blue marbles are out of the bag, so the score is 0.
Game #2:
Draw 1: Blue marble, game continues.
Draw 2: Blue marble, game continues.
Draw 3: Red marble. Game stops. Two blue marbles are out of the bag, so the score is 2.
So lets take a look at the odds here, the statistics of this game. There are 9 blue marbles and 1 red marble.
Odds of scoring 0 in the game (hitting the 1 red marble right away) are 1 in 10, 10%.
Odds of scoring 1 in the game (drawing one blue marble first, and then hitting the red on the second draw): (9/10) - which is the number of blue marbles divided by the number of marbles total - multiplied by (1/9) - the odds of hitting the (one) red marble in the remaining pool of 9. (9/10)*(1/9) = (1/10) = 1 in 10 = 10%. Strange, no? The odds of straight out hitting that red marble first draw are 10%, 1 in 10. To achieve a score of 1, you had to avoid hitting the red marble that first time, leaving it in the bag, and draw a blue first, then the red.
Taking this trend to the extreme, scoring 9, how would that work?
Draws:
Blue, blue, blue, blue, blue, blue, blue, blue, blue, red.
Math (probabilities):
(9/10)*(8/9)*(7/8)*(6/7)*(5/6)*(4/5)*(3/4)*(2/3)*(1/2)*(1/1, the red marble, only one left) - all of which cancels out to....1 in 10.
So I could go on at length about the different iterations of this game, and how the odds "make sense to me" if there are for example 2 red marbles and 8 blue, instead of 1 and 9, but what it boils down to is this:
According to the math (and the computer simulation I did), it's equally likely that:
1. Someone will hit the red marble with their first draw.
AND
2. Someone will avoid the red marble, drawing a blue; again, they'll avoid the red marble, drawing a blue; they'll do this again; and again; and again; and again; until they've "avoided" drawing that single red marble 9 times until finally it's the only thing left in the bag for them to draw out.
How can these two scenarios be equally likely?
Help :(
I need a little help (for my own sanity) puzzling out a statistical anomaly. It's been bugging me for years, ever since I took undergraduate statistics, and I haven't found an answer yet. I even wrote a computer program (in Fortran, no less - how retro of me) to simulate 10,000,000 iterations of the following "game" I'm about to describe, and the simulation came out with the exact same results as the math. What I don't understand is the "why", as it flies in the face of logic, and while I accept that some statistics do that, it would be fantastic to have some input and maybe even some "closure" on this issue for me.
Stated simply, the "game" goes like this:
You take ten marbles, and put them into a bag. The marbles are either red or blue, a random number of at least one each, but adding up to ten marbles total. The bag is shaken up, and someone takes out a single marble at random. If the marble is red, the game stops. If the marble is blue, it's put aside (not back in the bag), and another marble is taken out. If it's red, the game stops. If it's blue, it's put aside and another marble is taken out. Etc, etc. The game is over when a red marble has been drawn.
At the end of the game, you count up the number of blue marbles out of the bag, and that is your score for the game. So lets look at some examples of gameplay. For the sake of argument, we're going to say that there are 9 blue marbles in the bag, and one red marble.
Game#1: Boom, red marble right away. Game stops. No blue marbles are out of the bag, so the score is 0.
Game #2:
Draw 1: Blue marble, game continues.
Draw 2: Blue marble, game continues.
Draw 3: Red marble. Game stops. Two blue marbles are out of the bag, so the score is 2.
So lets take a look at the odds here, the statistics of this game. There are 9 blue marbles and 1 red marble.
Odds of scoring 0 in the game (hitting the 1 red marble right away) are 1 in 10, 10%.
Odds of scoring 1 in the game (drawing one blue marble first, and then hitting the red on the second draw): (9/10) - which is the number of blue marbles divided by the number of marbles total - multiplied by (1/9) - the odds of hitting the (one) red marble in the remaining pool of 9. (9/10)*(1/9) = (1/10) = 1 in 10 = 10%. Strange, no? The odds of straight out hitting that red marble first draw are 10%, 1 in 10. To achieve a score of 1, you had to avoid hitting the red marble that first time, leaving it in the bag, and draw a blue first, then the red.
Taking this trend to the extreme, scoring 9, how would that work?
Draws:
Blue, blue, blue, blue, blue, blue, blue, blue, blue, red.
Math (probabilities):
(9/10)*(8/9)*(7/8)*(6/7)*(5/6)*(4/5)*(3/4)*(2/3)*(1/2)*(1/1, the red marble, only one left) - all of which cancels out to....1 in 10.
So I could go on at length about the different iterations of this game, and how the odds "make sense to me" if there are for example 2 red marbles and 8 blue, instead of 1 and 9, but what it boils down to is this:
According to the math (and the computer simulation I did), it's equally likely that:
1. Someone will hit the red marble with their first draw.
AND
2. Someone will avoid the red marble, drawing a blue; again, they'll avoid the red marble, drawing a blue; they'll do this again; and again; and again; and again; until they've "avoided" drawing that single red marble 9 times until finally it's the only thing left in the bag for them to draw out.
How can these two scenarios be equally likely?
Help :(