Chapter 2
1. Jason appears to be growing slowly as a toddler. His height between 18 and 30
months of age increases as follows:
Age (mos) Height (cm)
18 76.5
21 78.7
24 82.0
27 84.8
30 86.0
a.) Make a graph of Jason's height against
his age. Do the data show a clear linear
pattern so that you are willing to use
a fitted line as an overall description?
b.) The least-squares regression line
fitted to these data is
y = 61.5 +
0.837t
Plot this line on your graph from
(a).
c.) According to this line, how much does
Jason grow each month? If this line described
Jason's growth from birth, what would
be his height at birth? (In fact, growth is not
linear in the early months of life, so
the line does not describe his birth height.)
d.) Use the fitted line to predict Jason's
height at age 2 years. Then calculate the residual
for this age. Where do the residuals
appear in your graph from (b)?
e.) Would you be willing to use the fitted
line to predict Jason's height at 21 years?
Explain your answer.
2. A rural landowner has a pond with area
50,000 square feet. One day he notices a
growth of algae in one corner of the pond
covering 8 square feet. The algae doubles
its area each day.
a.) How much area does the algae occupy
10 days later?
b.) At this point the landowner becomes
concerned because about 16% of the pond is
covered with algae. How many more
days will the algae take to cover the entire
pond?
3. Identify each bold face variable below as
quantitative or categorical, and also identify
the explanatory and response variable in
each setting.
a.) A political scientist believes that
there is a gender gap in American voting, with
women more likely to vote
Democratic. She therefore interviews a random sample
of voters and records the sex
of the respondents and the political party of the
candidate for whom they voted in the
last presidential election.
b.) A study of the relationship between education
and income records the annual
earned income and the years
of school completed of each of a large sample of
subjects. In addition, the study
records the type of high school attended (public,
religious, or private non-religious).
4. The following table gives data on the lean body mass (kilograms) and resting
metabolic rate for 12 women and 7 men who
are subjects in a study of obesity. The
researchers suspect that lean body mass
(that is, the subject's weight leaving out all
fat) is an important influence on
metabolic rate.
Subject Sex Mass Rate
_________________________________________
1 M 62.0 1792
2 M 62.9 1666
3 F 36.1 995
4 F 54.6 1425
5 F 48.5 1396
6 F 42.0 1418
7 M 47.4 1362
8 F 50.6 1502
9 F 42.0 1256
10 M 48.7 1614
11 F 40.3 1189
12 F 33.1 913
13 M 51.9 1460
14 F 42.4 1124
15 F 34.5 1052
16 F 51.1 1347
17 F 41.2 1204
18 M 51.9 1867
19 M 46.9 1439
a.) Make a scatterplot of the data for
the female subjects. Which is the explanatory variable?
b.) Is the association between these
variables positive or negative? What is the overall shape of the
relationship?
c.) Now add the data for the male
subjects to your graph, using a different color or a different plotting
symbol. Does the type of
relationship that you observed in (b) hold for men also? How do the male
subjects as a group differ from the
female subjects as a group?
5. Many manatees in Florida are killed or
injured by power boats. The table below gives
data on power boat registration (in
thousands) and the number of manatees killed by
boats in Florida in the years 1977-1987.
a.) Make a scatterplot of boat
registrations and manatees killed. The overall relationship is roughly linear.
b.) Calculate the least-squares
regression line and draw it on your graph.
c.) Power boat registrations in 1990
increased to 719,000. Based on the data given here, predict the
number of manatees killed by boats
in 1990.
d.) Which point on the graph has the
largest residual, either positive or negative? Calculate the residual for
that point. Do you think that this
point will be highly influential?
Manatees
Year
Boats Killed
___________________________________
1977 447 13
1978 460 21
1979 481 24
1980 498 16
1981 513 24
1982 512 20
1983 526 15
1984 559 34
1985 585 33
1986 614 33
1987 645 39
When adults are asked their weight, the
weight that they report tends to be less than
their actual weight as measured by a
scale. But there is a strong relationship between
reported weight and measured weight,
because heavy people usually report higher
weights than do light people. Here are
the measured weights x and reported weights
y (in pounds) for 5 female subjects.
______________________
x 112 123 178
141 135
_______________________
y 110 120 165
125 129
________________________
6. Make a scatterplot of these data. Which
observation has the greatest influence on the
position of the regression line and the
value of the correlation coefficient?
7. Compute the correlation coefficient r
between x and y. What percent of the variation
in the weights reported by these subjects
is accounted for by the fact that reported
weight varies linearly with measured
weight?
8. Suppose that all of the subjects reported
a weight 5 pounds less than the values of y in
the table. Would this change the value of
r?
9. Use your value of the correlation r and
the means and standard deviations of x and y to
give the equation of the least-squares regression
line of reported weight on measured
weight. Draw this line on your
scatterplot.
10. Explain
what is wrong with each of the following statements.
a.) "Our study shows that the
correlation between a voter's religion and the political
party he or she prefers is r =
0.45."
b.) "We found that the correlation
between the number of hours per week a student
spends watching television and the
student's grades is r = - 1.13."