Next: About this document ...
Lecture 16: Student's t Statistic
We want to consider a very important special case of an orthogonal
transformation of a random sample, V, of size k from a mean zero normal
population.
Suppose that the first column of my orthogonal matrix O has all entries equal
to
. Put
. Then R is also a random
sample of size k from the same normal distribution,

and

while

Observe that since R1 and the vector
are independent,
we have shown that for a random sample from a normal distribution with mean 0
the maximum likelihood estimate of the mean and the maximum likelihood estimate
of the variance are independent. In fact, if the random sample were from a
normal population with arbitrary mean
, since
is normally
distributed with mean 0, and the maximum likelihood estimate of the variance
is unchanged when the a constant is subtracted from the observations, we have
shown that the maximum likelihood estimates of the mean and the variance are
independent whenever the population is normally distributed.
What is more, the maximum likelihood estimate of the mean is a random variable
with a normal distribution, and the maximum likelihood estimate of the variance
has a gamma distribution. This makes it possible to compute the density
of the ratio of these two maximum likelihood ratios. Before we do so, let
us see why we might want to bother.
Suppose that we had a population that was known to have a normal distribution,
but we knew neither the mean nor the variance of this distribution. In fact,
our problem was to try to determine if the mean of this normal population was 0
or not. To do this, we initially might try to use collect a random sample of
size k, compute the maximum likelihood estimate of the mean,

and decide that if
is near to 0 then the population mean is 0
and if
is not near to 0 then the population mean is not 0. We
would quantify this by picking a set A containing 0, and saying
- Accept the assertion that the mean is 0 if
. - Reject the assertion that the mean is 0 if
.
The natural questions to ask are then
- If the mean really is 0, how likely is it that our test will tell us to
reject the assertion that the mean is 0? In other words, what is

if the mean is 0?
- If the mean really is not 0, how likely is it that our test will tell us
to accept the assertion that the mean is 0? In other words, what is

if the mean is 0?
The problem is that we cannot answer even the first question, for although we
know that
has a normal distribution with mean 0 we do not know
its variance. If the variance were known we could divide by its square root,
the standard deviation, and get a random variable with a standard normal
distribution. Of course, then A would depend on this standard deviation. What
to do?
The obvious thing to try would be to try to divide by an estimate of the
standard deviation. One convenient estimate would be the maximum likelihood
estimate of the standard deviation,

especially since we know
that it is independent of
! At first glance this seems like it
won't quite work as the distribution of
also depends on the
standard deviation of the sample,
. However, all is not lost, since

Now notice that if the population mean is really 0, then
is the square root of the sum of the squares
of k-1 independent, normally distributed random variables, each with mean 0
and variance 1, and
is a random variable which is
normally distributed with mean and variance 1. Thus if V is a random
sample from a normal distribution with mean 0, the distribution of

is the same as the distribution of the ratio of two random variables R and
S, R/S,
where
- R and S are independent;
- R has the standard normal distribution,
- S2 has a gamma distribution with
and
,that is, S2 has a chi-square distribution with k-1 degrees of freedom.
It is straightforward to find the density of R/S, so we shall do so below.
Right now, it is important to realize that we can formulate our test about
whether or not the population mean of our normal population is 0 in terms of
, and the all probabilities involving t(V) can be computed
so long as the sample size is known. t(V) is called Student's t
statistic and its density (really a family of densities depending on the
sample size k) is called the t density.
A formula for the t density
Let R and S be as above. Then since R and S are independent and R has a
density which is symmetric about 0, R/S also will have a density which is
symmetric about 0. Thus we will first try to find
for any positive number a. From here it is easy to
get the density of R/S. Now

and we know that since R2 and S2 are independent gamma random variables
with
and
parameters equal to 1/2 and (k-1)/2
respectively, that S2/(R2 + S2) has a B((k-1)/2,1/2) distribution.
Therefore,

and so

If we differentiate this expression we obtain the density of R/S for a > 0:

However, since the density of R/S is symmetric about 0, the preceding
formula gives the density of R/S, and, therefore, T(V), for all a. It then
follows immediately from the definition of t(V) that the density of t(V) is
given by

This density is called the t density with k-1 degrees of freedom. Notice
that as
that ft(V)(x) converges to the standard
normal density. This is one justification for multiplying T(V) by the factor
.
More generally, if we have a random sample, V, of size
from a normal
population with
mean
and variance
, then

has a t distribution with k-1 degrees of freedom.
Next: About this document ...
Eric S Key
11/9/1998