next up previous
Next: About this document ...

Lecture 1: Some Simple Examples of Statistical Methods

I have in my possession two pennies which appear, to the naked eye, identical. However, one of them comes up heads 1/2 the time while the other comes up heads only 40 percent of the time. Unfortunately, I have confused them, and then lost one of them. I would like to design an experiment to determine which one I have left.

I think that that the one that is left is the trick one. If this is the case, I expect that in large number of tosses, roughly 40 percent of the tosses will be heads. My experiment will be of the following sort. I will toss the coin 100 times. If I get around 40 heads, I will conclude that my coin is the trick one. If not, I will conclude that it is not the trick one. My problem is to make precise the idea of around 40 heads. I know that with the trick coin I might get 35 heads or 45 heads in 100 tosses. On the other hand, I would be astonished if I got 10 heads, or 90 heads.

I decide then that my test will be if I get anywhere from 35 to 45 heads I will be satisfied that it is the trick coin, and otherwise I will conclude that I have lost the trick coin and all that I have is an ordinary fair coin.

There are then four possibilities before I run my test:

1.
I have the trick coin and as a result of my experiment I conclude that I have the trick coin.
2.
I have the trick coin and as a result of my experiment I conclude that I have an ordinary coin.
3.
I have an ordinary coin and as a result of my experiment I conclude that I have the trick coin.
4.
I have an ordinary coin and as a result of my experiment I conclude that I have an ordinary coin.
One measure of how effective my test is to that there be very little chance that (2) and (3) occur. Fortunately, the probabilities of these events can be calculated as follows:

Suppose that, for example, I wanted to know the probability of getting 35 heads in 100 tosses of a coin. If I think of a sequence of 100 H's and T's in which 35 H's occur, each such sequence accounts for one way the experiment of tossing the coin 100 times could occur. Any of these individual sequences is equally likely, so if I could count how many sequences there were, and then multiply by the chance one of them occurs, I would know the chance of getting 35 heads.

Each of these steps is easily computed. In the first place, there are

\begin{displaymath}
\frac{100!}{35!65!}\end{displaymath}

sequences containing exactly 35 H's and 65 T's. Secondly, if p is the chance of heads expressed as a decimal, the chance of each sequence is

p35(1-p)65.

Thus the chance of exactly 35 H's and 65 T's is

\begin{displaymath}
\frac{100!}{35!65!}p^{35}(1-p)^{65}.\end{displaymath}

If we repeat this analysis for a different number of heads, say x, we see that the chance of x heads and 100-x tails when p is the chance of a single head is given by

\begin{displaymath}
\frac{100!}{x!100-x!}p^{x}(1-p)^{100-x}.\end{displaymath}

Thus the chance of (1) is

\begin{displaymath}
\sum_{x=35}^{45} (0.4)^x(0.6)^{100-x} \approx 0.739,\end{displaymath}

the chance of (2) is

\begin{displaymath}
\left(\sum_{x=0}^{34} (0.4)^x(0.6)^{100-x}\right) + \left(\sum_{x=46}^{100}
(0.4)^x(0.6)^{100-x}\right) 
 \approx 0.261, \end{displaymath}

the chance of (3) is

\begin{displaymath}
\sum_{x=35}^{45} (0.5)^x(0.5)^{100-x} \approx 0.183,\end{displaymath}

and the chance of (4) is

\begin{displaymath}
\left(\sum_{x=0}^{34} (0.5)^x(0.5)^{100-x}\right) + \left(\sum_{x=46}^{100}
(0.5)^x(0.5)^{100-x}\right) 
 \approx 0.817.\end{displaymath}

Since the chances of making the wrong decision seem rather high, we could modify our test in two obvious ways: toss the coin more, or change the region for making our decision.

If we double everything, so that we have 200 tosses and 70 to 90 heads we get that the chance of (1) is about 0.871, the chance of (2) is about 0.129, the chance of (3) is about 0.089 and of (4) about 0.911. Remember, we want (2) and (3) to be small. This is certainly better.

On the other hand, if we simply make the interval wider while using 100 tosses, say by using 33 to 47 heads as our criteria, the chance of (1) is nearly 0.875, and the chance of (2) nearly 0.125, but the chance of (3) becomes nearly 0.310, and the chance of (4) drops to about 0.690.

Thus somehow we must strike an acceptable balance between a lengthy experiment and acceptable values for the probability of the test being in error.

In order to begin to make order out of this chaos, we have to have some description of what is happening in general. There are three basic things common to all analyses.

1.
The set of possible outcomes of an experiment. This is called the sample space. In our original case it consisted of all possible sequence s of H's and T's of length 100. In our modified test, it was all possible sequences of H's and T's of length 200.
2.
A set of subsets of the sample space. Not any set of subsets will do. For example, in our original test, we needed the set of all elements with exactly 35 H's and 65 T's, and we needed the complement of this set. We will see later exactly what rules this set of subsets needs to obey. A set of subsets of the sample space which obeys these rules will be called a sigma algebra or a sigma field.

3.
A way to assign probabilities to the elements of the sigma algebra. We had for each of our sigma algebras two related ways of doing this, essentially by counting. There are certain rules that such assignments must obey, and assignments which obey these rules are called probability measures.


 
next up previous
Next: About this document ...
Eric S Key
8/31/1998