Next: About this document ...
Lecture 8: Distribution functions
Not all random variables are discrete, nor, for modeling purposes, would we
want to restrict our attention to discrete random variables. If we return, for
a moment, to the problem of modeling ``draw a real number at random from
[0,1]'' we noted that given any interval
, the event that
the number drawn is in
should have probability equal to the length of
. Hence we want a probability space where
, where the
sigma field contains all subintervals of [0,1], and where the probability
measure of an interval is its length. It is shown in courses on measure theory
that such a probability space exists. (See, for example, Real and Complex
Analysis by Walter Rudin.) The relevant random variable for this this example
is
. Here the range of R is all of [0,1], and as we
explained earlier,

for every y! Hence there is nothing to be gained from looking at the
probability mass function. The solution is look at something different, called
the distribution function of R, denoted by FR, defined by

Every random variable has a distribution function, and all the
information about probablities involving the random variable can be derived
from its distribution function.
For the moment, observe that
![\begin{displaymath}
\Pr(R \in (a,b]) = \Pr(R \leq b) - \Pr(R \leq a) = F_R(b) - F_R(a).\end{displaymath}](img7.gif)
Also notice that any distribution function is non-decreasing and has range
contained in
.
We will have to have another look at properties of probability measures to
show how to use the distribution function to compute
for
any interval.
Continuity properties of probability measures and distribution functions
Consider a sequence of events,
. We have

Define the limit of these sets, denoted by
by

One justification for considering this to be a limit is that the sets An
expand out to give us all of the union of the An. The following is an
important consequence of the additivity axiom for probability measures:
If
, then

This is called the continuity property of probability measures, and is in fact,
equivalent to the additivity axiom. The proof is relatively straightforward.
Put B1=A1 and Bn = An-An-1 for n > 1. The events Bn are disjoint
and for each positive integer n

From this we see that

Therefore

as we claimed.
It is also the case that if we have events
and we define

then

This is easily derived from the previous result by using Boole's identities.
These results are used to investigate the continuity of distribution functions.
Let FR be the distribution function of the random variable R. Since
distribution functions are non-decreasing they have left and right hand limits
at all points. To see if FR is continuous at t it is suffient to check to
see if

converge to FR(t) when n takes only integer values. Fix a value t and let
![\begin{displaymath}
C_n = R \in (-\infty,t+n^{-1}]\end{displaymath}](img22.gif)
Observe that these sets are a nested decreasing sequence of events, and

Therefore

so a distribution function is always continuous from the right.
Now put
![\begin{displaymath}
A_n = R \in (-\infty,t-n^{-1}]\end{displaymath}](img25.gif)
Observe that these sets are a nested increasing sequence of events, and

Therefore

and since

we have

so that FR is continuous from the right at t if and only if
the probability that R equals t is zero. Therefore, FR is continuous at
t if and only if the probability that R equals t is zero, and the jump in FR
at t is equal to the probability that R equals t. The values of t for which
FR jumps are called the atoms of the distribution of R. The limit from
below at t of FR is denoted FR(t-).
With this notation we have
. Therefore,

so we have represented the probability that R is in any type of interval in
terms of the distribution function of R.
In fact, a distribution function F is characterized by the following properties
-
and
; - F is non-decreasing;
- F is continuous from the right with lefthand limits.
It can be shown that for any function F with these properties there is a
probability model
and and a random variable R so that
F is the distribution function of R.
Independence of random variables in general
We can show that any pair of random variables R and S is independent if for any
pair of real numbers x and y,

The function of two variables, FR,S defined by

is called the joint distribution function of R and S. More generally, if
is a vector-valued random variable, its joint distribution
function, FV, is the function on
defined by

It can be shown that the components of V are independent if and only if

Next: About this document ...
Eric S Key
10/1/1998