Next: Application to the binomial
Up: Probability theory
Previous: The binomial distribution
What is meant by the mean or average of a quantity? Well, suppose that we
wanted to calculate the average age of undergraduates at the University of Texas at Austin.
We could go to the central administration building and find
out how many eighteen year-olds, nineteen year-olds, etc. were currently
enrolled. We would then write something like
|
(24) |
where is the number of enrolled eighteen year-olds, etc.
Suppose that we were to pick a student at random and then ask ``What is
the probability of this student being eighteen?'' From what we have
already discussed, this probability is defined
|
(25) |
where
is the total number of enrolled
students.
We can now see that the average age takes
the form
|
(26) |
Well, there is nothing special about the age distribution of students
at UT Austin. So, for a general variable , which can take on any one of
possible values ,
, with corresponding probabilities
,
,
the mean or average value of , which
is denoted , is defined as
|
(27) |
Suppose that is some function of . Then, for each of
the possible values of , there is a corresponding value
of which occurs with the same probability. Thus, corresponds
to and occurs with the probability , and so on. It follows from
our previous definition that the mean value of is
given by
|
(28) |
Suppose that and are two general functions of . It follows that
|
(29) |
so
|
(30) |
Finally, if is a general constant then it is clear that
|
(31) |
We now know how to define the mean value of the general variable .
But, how can we characterize the scatter around the mean value?
We could investigate the deviation of from its mean value ,
which is denoted
|
(32) |
In fact, this is not a particularly interesting quantity, since its average
is obviously zero:
|
(33) |
This is another way of saying that the average deviation from the
mean vanishes. A more interesting quantity is the square of the
deviation. The average value of this quantity,
|
(34) |
is usually called the
variance.
The variance is clearly a positive number,
unless there is no scatter at all in the
distribution, so that all possible values of correspond to the
mean value , in which case it is zero.
The following general relation
is often useful
|
(35) |
giving
|
(36) |
The variance of
is proportional to the square of the scatter
of around its mean value. A more useful measure of the scatter
is given by the square root of the variance,
|
(37) |
which is usually called the standard deviation of . The
standard deviation is essentially the width of the range over which
is distributed around its mean value .
Next: Application to the binomial
Up: Probability theory
Previous: The binomial distribution
Richard Fitzpatrick
2006-02-02