Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Regression and Predictions. Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the president/opponent heights, find the best predicted height of an opponent of a president who is 190 cm tall. Does it appear that heights of opponents can be predicted from the heights of the presidents?

President

178

182

188

175

179

183

192

182

177

185

188

188

183

Opponent

180

180

182

173

178

182

180

180

183

177

173

188

185

Short Answer

Expert verified

The regression equation is\(\hat y = 161.9 + 0.097x\).

Thebest predicted height of an opponent of a president who is 190 cm tall is 180 cm.

No, the heights of opponents cannot be predicted using the heights of presidents.

Step by step solution

01

Given information

Values are given on two variables namely, the president’s height and the opponent’s height.

02

Calculate the mean values

Let x represent thepresident’s height.

Let y represent theopponent’s height.

Themean value of xis given as,

\(\begin{array}{c}\bar x = \frac{{\sum\limits_{i = 1}^n {{x_i}} }}{n}\\ = \frac{{178 + 182 + .... + 188}}{{14}}\\ = 183.429\end{array}\)

Therefore, the mean value of x is 183.429.

Themean value of yis given as,

\(\begin{array}{c}\bar y = \frac{{\sum\limits_{i = 1}^n {{y_i}} }}{n}\\ = \frac{{180 + 180 + .... + 175}}{{14}}\\ = 179.714\end{array}\)

Therefore, the mean value of y is 179.714.

03

Calculate the standard deviation of x and y

The standard deviation of x is given as,

\(\begin{array}{c}{s_x} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({x_i} - \bar x)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {178 - 183.429} \right)}^2} + {{\left( {182 - 183.429} \right)}^2} + ..... + {{\left( {188 - 183.429} \right)}^2}}}{{14 - 1}}} \\ = 5.003\end{array}\)

Therefore, the standard deviation of x is 5.003.

The standard deviation of y is given as,

\(\begin{array}{c}{s_y} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({y_i} - \bar y)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {180 - 179.714} \right)}^2} + {{\left( {180 - 179.714} \right)}^2} + ..... + {{\left( {175 - 179.714} \right)}^2}}}{{14 - 1}}} \\ = 4.304\end{array}\)

Therefore, the standard deviation of y is 4.304.

04

Calculate the correlation coefficient

Thecorrelation coefficient is given as,

\(r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\)

The calculations required to compute the correlation coefficient are as follows:

The correlation coefficient is given as,

\(\begin{array}{c}r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\\ = \frac{{14\left( {461538} \right) - \left( {2568} \right)\left( {2516} \right)}}{{\sqrt {\left( {\left( {14 \times 471370} \right) - {{\left( {2568} \right)}^2}} \right)\left( {\left( {14 \times 452402} \right) - {{\left( {2516} \right)}^2}} \right)} }}\\ = 0.1133\end{array}\)

Therefore, the correlation coefficient is 0.1133.

05

Calculate the slope of the regression line

The slopeof the regression line is given as,

\(\begin{array}{c}{b_1} = r\frac{{{s_Y}}}{{{s_X}}}\\ = 0.1133 \times \frac{{4.304}}{{5.003}}\\ = 0.097\end{array}\)

Therefore, the value of slope is 0.097.

06

Calculate the intercept of the regression line

The interceptis computed as,

\(\begin{array}{c}{b_0} = \bar y - {b_1}\bar x\\ = 179.714 - \left( {0.1133 \times 183.429} \right)\\ = 161.838\end{array}\)

Therefore, the value of intercept is 161.838.

07

Form a regression equation

Theregression equationis given as,

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 161.838 + 0.097x\end{array}\)

Thus, the regression equation is \(\hat y = 161.838 + 0.097x\).

08

Analyze the regression model

Referring to exercise 26 of section 10-1,

1)The scatter plot does not show a linear relationship between the variables.

2)The P-value is 0.700.

As the P-value is greater than the level of significance (0.05), this implies the null hypothesis fails to reject.

Therefore, the correlation is not significant.

Referring to figure 10-5,the criteria for a good regression model are not satisfied.

Therefore, the regression equation cannot be used to predict the value of y.

The best predicted height of an opponent of a president who is 190 cm tall is computed as,

\(\begin{array}{c}\hat y = \bar y\\ = 179.71\end{array}\)

Therefore, the best predicted height of an opponent of a president who is 190 cm tall is 180 cm.

09

Discuss the prediction methods

No, the heights of opponents cannot be predicted from the heights of the presidents using the regression equation as the regression model is not good due to insignificant correlation.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Outlier Refer to the accompanying Minitab-generated scatterplot. a. Examine the pattern of all 10 points and subjectively determine whether there appears to be a correlation between x and y. b. After identifying the 10 pairs of coordinates corresponding to the 10 points, find the value of the correlation coefficient r and determine whether there is a linear correlation. c. Now remove the point with coordinates (10, 10) and repeat parts (a) and (b). d. What do you conclude about the possible effect from a single pair of values?

Explore! Exercises 9 and 10 provide two data sets from “Graphs in Statistical Analysis,” by F. J. Anscombe, the American Statistician, Vol. 27. For each exercise,

a. Construct a scatterplot.

b. Find the value of the linear correlation coefficient r, then determine whether there is sufficient evidence to support the claim of a linear correlation between the two variables.

c. Identify the feature of the data that would be missed if part (b) was completed without constructing the scatterplot.

x

10

8

13

9

11

14

6

4

12

7

5

y

9.14

8.14

8.74

8.77

9.26

8.10

6.13

3.10

9.13

7.26

4.74

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fentanyl) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain Intensity Before Duragesic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6










Pain Intensity After Duragesic Treatment

0.4

1.4

1.8

2.9

6

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8

6.8

2.3

0.4

0.7

1.2

4.5

2

1.6

2

2

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1










Correlation Use the given data to construct a scatterplot, then use the methods of Section 10-1 to test for a linear correlation between the pain intensity before and after treatment. If there does appear to be a linear correlation, can we conclude that the drug treatment is effective?

Time and Motion In a physics experiment at Doane College, a soccer ball was thrown upward from the bed of a moving truck. The table below lists the time (sec) that has lapsed from the throw and the height (m) of the soccer ball. What do you conclude about the relationship between time and height? What horrible mistake would be easy to make if the analysis is conducted without a scatterplot?

Time (sec)

0.0

0.2

0.4

0.6

0.8

1.0

1.2

1.4

1.6

1.8

Height (m)

0.0

1.7

3.1

3.9

4.5

4.7

4.6

4.1

3.3

2.1

Interpreting\({R^2}\)In Exercise 2, the quadratic model results in = 0.255. Identify the percentage of the variation in Super Bowl points that can be explained by the quadratic model relating the variable of year and the variable of points scored. (Hint: See Example 2.) What does the result suggest about the usefulness of the quadratic model?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free