The main reason that would account for the high correlation is that countries with a large number of TV's per person will be countries with high standards of living, so have healthier people. Therefore, they live longer.
However, any explanation that is reasonable will be accepted.
NOTE: This part has an error that is revealed if someone tries
to use both approaches to get
: the results do not match. This is
because the SSE value give in the question is wrong. It should be
about 678, not 33.9. So, give full marks to either approach,
regardless of the answer.
First,
From (a), we know that
. We are also told that
SSE = 33.9 and
. Therefore
P-value =
with
df.
Since the T-distribution is symmetric,
.
From the T-table, we find that
.
Since
,
.
Therefore p-value
.
If you used Minitab, you would have found
(approximately).
Therefore we have very strong evidence against
.
Therefore there is very strong evidence to suggest there is a positive
linear relationship between
and
.
When
,
.
We want a 95% CI, so
, so
using the T-table with
df.
The 95% CI is
From the plot, it appears that, as the rank on the wedding day gets higher (which, for tennis players, means they are getting worse), the anniversary ranking gets higher. However, it doesn't seem to be a very strong relationship. It also seems that an assumption of a linear relationship is questionable.
The regression equation is Anniv = 15.9 + 0.927 Wedding Predictor Coef SE Coef T P Constant 15.88 12.85 1.24 0.231 Wedding 0.9273 0.3782 2.45 0.024 S = 41.60 R-Sq = 23.1% R-Sq(adj) = 19.3%From the output, we see the least squares line is
From the Minitab output
Since p-value =
(using T-distribution with
df) , reject
.
Or,
, so reject
.
Therefore there does appear to be linear relationship between wedding and anniversary ranking.
In other words, a ranking of 50 on the wedding day would still be a 50 on the
first anniversary. This means
and
, so the equation
would be
.
Predicted Values for New Observations New Obs Fit SE Fit 99.0% CI 99.0% PI 1 35.35 8.97 ( 9.82, 60.88) ( -85.73, 156.44) Values of Predictors for New Observations New Obs Wedding 1 21.0we see that the 99% PI for
You could also calculate this interval by hand, where
from the output, and the other terms would
have to be calculated by hand.
Note that the lower value of the PI isn't realistic, since it says the player's ranking could be -85.
Find the least squares equation that predicts sales from the explanatory variables.
A portion of the Minitab output is below:
Regression Analysis: Drywall versus Permits, Mortgage, ... The regression equation is Drywall = - 138 + 4.97 Permits + 20.7 Mortgage - 10.9 A Vacanc + 0.50 O Vacanc Predictor Coef SE Coef T P Constant -137.9 163.0 -0.85 0.412 Permits 4.9657 0.4869 10.20 0.000 Mortgage 20.70 19.77 1.05 0.313 A Vacanc -10.946 8.019 -1.36 0.194 O Vacanc 0.504 3.336 0.15 0.882 S = 43.68 R-Sq = 89.6% R-Sq(adj) = 86.6%
From the output, the least squares equation is
. This tells us that, if mortgage rates
increase by 1 percentage point, and the other explanatory variables
remain fixed, we predict that drywall sales will increase by 2070
sheets (since sales are reported in 100's of sheets).
Question to consider: Do you think this is unusual? We may have assumed that increasing mortgage rates would mean less building, so less demand for drywall.
From the output,
.
By the definition, 89.6% of the variability in drywall sales can be accounted for (or explained) by the regression line relating pay to number of permits, mortgage rates and vacancy rates.
In a general sense, this value of
indicates we have a
reasonable model for predicting sales.
The needed Minitab output is below:
Analysis of Variance Source DF SS MS F P Regression 4 229881 57470 30.11 0.000 Residual Error 14 26717 1908 Total 18 256599
From the output
Given
: Since p-value
, reject
.
Or,
, so reject
.
So model appears useful-at least one of the explanatory variables helps to predict drywall sales.