Mathematical Statistics 465 Homework 01, Spring 2005
Due: February 10, 2005 at the start of class
TCGtS stands for The Cartoon Guide to Statistics .
- Collect a data set consisting of the weights (in pounds) and heights (in
inches) of at least 50 people. Put each height/weight pair on a separate index
card.
- Make a stem-and-leaf plot for the weights, and a stem-and-leaf plot for
the weights.
- Find the mean, median, and quartiles for the heights and the weights.
- Find the inter-quartile range and the standard deviation for the heights
and weights.
- Make box-and-whisker plots for the heights and for the weights.
- Make histograms for the heights and the weights.
- Make a scatter plot of your data. Put the heights on the horizontal axis
and the weights on the vertical axis.
- Make a copy of your scatter plot and overlay the least squares regression
line. What is the sum of the squared errors?
- Make a copy of your scatter plot and overlay the median-median line.
What is the sum of the squared errors?
- Make a copy of the scatter plot and overlay the least squares quadratic
function. What is the sum of the squared errors.
- Make a copy of the scatter plot and overlay the least squares cubic
function. What is the sum of the squared errors.
- What differences do you notice in your squared errors among the four
plots?
- What is the correlation coefficient for the height and the weight data you
have collected?
- What do your results tell you about the following statement and its
relationship to your data? Mass is proportional to volume, and for a sphere,
volume is proportional to the cube of the diameter.
- What do you expect the histogram to look like for 100 rolls of a die?
Now,
roll a single die 100 times and record the outcomes. Give summary data and
an histogram. What do you make of this?
- Get dice of two different colors, or at least a pair of dice where you
can tell one from the other. (I will refer to them as red and green.) Roll
this pair of dice 100 times, and record each outcome as an ordered pair,
(Green, Red).
- If we were to make a histogram of the sum of the outcomes of each toss,
what would you expect the histogram to look like? What is the relation to an
addition table for the whole numbers from 1 to 6?
- Construct a histogram of these sums from your data, as well as summary
statistics. How different are they? Propose a statistic that measures this
difference and compute it.
- Construct a scatter plot of your ordered pairs. Compute lines of best fit
both by least squares and by the median-median line method. How good are these
fits? Compare the results with the height weight data. Explain any
similarities and differences.
- What is the correlation coefficient for the results on the green and the
red dice? Interpret this result.