Next: About this document ...
Lecture 15: Transformations of Random Vectors
Considered together, the maximum likelihood estimates of the mean and variance
of a normal distribution constitute a vector valued statistic. We will
now look at trying to compute the distribution functions of such statistics.
We shall confine ourselves to the case where the original random variables have
a distribution given by a density. The computations are greatly aided by
the change of variables theorem for multiple integrals.
Change of variables in multiple integrals
Recall from calculus that if
is an
increasing continuous function and
is differentiable on (a,b) then

while if
is a decreasing
continuous function and
is differentiable on (a,b) then

These two formulae can be combined into one if one adopts the convention that
stands for the interval whose endpoints are
and
. Since
decreasing means
, and
increasing means
, we can just write
![\begin{displaymath}
\int_{\phi([a,b])}f(x)\;dx = \int_{[a,b]} f(\phi(u))\vert\phi'(u)\vert\;du.\end{displaymath}](img11.gif)
For example, consider
![\begin{displaymath}
\int_0^{1/2}\sqrt{1-x^2}\;dx = \int_{[0,1/2]}\sqrt{1-x^2}\;dx\end{displaymath}](img12.gif)
Note that
and
is decreasing.
In calculus you learned to write

Simply observe that on the interval
,
and that our new version of the change of variables theorem eliminates one
step:
![\begin{displaymath}
\int_{[0,1/2]}\sqrt{1-x^2}\;dx
=
\int_{[\pi/3 ,\pi/2]}\sqrt{1-\cos^2(t)}(\sin(t))\;dt.\end{displaymath}](img18.gif)
If we had made a sine substitution, the formulae are equally efficient.
We want to apply this technique to multiple integrals. What makes it work
is that when we divide up [a,b] into subintervals, the function
tells
us how to divide up
into intervals, by mapping endpoints to
endpoints. The appearance of the factor
reminds us that the mean value
theorem can be used to get the length of each new subinterval in terms of the
old one, since it tells us that

for each subinterval
. If we now consider at closed
and bounded set
and a continuous function
which is differentiable and invertible on the interior of C, we see that

where
is the
matrix of derivatives of
, and x and
u represent d-tuples.
Here is a familiar example. Suppose that
,that is, the disk in the plane of radius A. We want to integrate some function
f(x,y) over D. If we put
,and
, then
, and
satisfies the conditions of the change of variables theorem.
![\begin{displaymath}
\phi'(r,\theta)
=
\left[\begin{array}
{cc}
\cos(\theta) & -r\sin(\theta)\ \sin(\theta) & r\cos(\theta)\end{array}\right]\end{displaymath}](img30.gif)
which has determinant
. Hence the change of variables formula
says

Since C is a rectangle, the latter integral is easily rewritten as an
iterated integral:

yielding the familiar polar coordinate formula for evaluating double integrals.
Some applications in statistics
Problem 1:
Suppose that R and S are independent gamma random variables with
equal r and s respectively, and
. Put

Let us find the distribution and density of T(R,S).
Solution:
Let fR and fS denote the densities of R and S, respectively.
We want to compute

Let the vector valued random variable (U,V) be defined by

Note that since R and S take only non-negative values, the range of T is
is the rectangle
. The inverse of T, call
it
is given by

Therefore, for a > 0 and
,

Therefore,

and by applying the change of variables theorem we get

Since C is a rectangle with sides parallel to the coordinate axes, this last
integral is easily written as an iterated integral and evaluated as soon as
we evaluate
. This is easy as
![\begin{displaymath}
\phi'(u,v) =
\left[\begin{array}
{cc}
v & u\ 1-v & -u\end{array}\right]\end{displaymath}](img46.gif)
so
since we know
. Using the explicit
formulae for the gamma densities we get

which simplifies to

for
. Therefore

simultaneously showing
-
; - R + S has a gamma distribution;
-
has a beta distribution;
- R + S and
are independent.
Problem 2: Suppose that R and S are independent random variables,
each with a standard normal distribution. Let t be a fixed real number, and
define the random vector (U,V) as the rotations of the random vector (R,S)
through the angle t. Discuss the properties of the random vector (U,V).
Solution:
Let fR and fS denote the densities of R and S, respectively.
We want to compute

where

Note that since R and S take any real values, the range of T is
is the rectangle the whole plane.
The inverse of T, call
it
is given by

Therefore,

where Q is the quadrant
.Since

the change of variables theorem implies

Since Q is a quadrant with sides parallel to the coordinate axes, this last
integral is easily written as an iterated integral and evaluated as soon as
we evaluate
. This is easy as
![\begin{displaymath}
\phi'(u,v) =
\left[\begin{array}
{cc}
\cos(t) & -\sin(t)\ \sin(t) & \cos(t)\end{array}\right]\end{displaymath}](img62.gif)
so
. Furthermore,

so

This shows that U and V are also independent random variables with the standard
normal distribution.
The preceding are examples of the following corollary to the change of
variables theorem. It gives us a formula for the density of the transformation
of a random vector V by the function g in terms of the density of V and
the inverse of g.
Theorem Suppose that V is a k dimensional vector valued random
variable with density fV and S is an open subset of Rk such that
. Suppose that
is one-to-one and has a
continous invertible first derivative on S. Let h be the inverse of
g. Then the density of the vector random variable g(V) is given by

Here is a very interesting example. Suppose that V is a random sample of
size k from
a normal population with mean 0 and variance
. Let O be any
matrix with the property that OtO = I. Let us find the density
of V0. The density of V is

where
is
and
means dot product. Here
g(V) = OV so h(Y) = YO-1 = YOt, so h'(Y) = Ot. Since Ot = O-1
we see that
and
, so

so we see that the components of VO are again a random sample of size k from
a normal population with mean 0 and variance
.
Next: About this document ...
Eric S Key
11/2/1998