Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Regression and Predictions. Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Use the listed boat/manatee data. In a year not included in the data below, there were 970,000 registered pleasure boats in Florida. Find the best predicted number of manatee fatalities resulting from encounters with boats. Is the result reasonably close to 79, which was the actual number of manatee fatalities?

Pleasure Boats

99

99

97

95

90

90

87

90

90

Manatee Fatalities

92

73

90

97

83

88

81

73

68

Short Answer

Expert verified

The regression equation is\(\hat y = 13.6 + 0.744x\).

Thebest predicted number of manatee fatalities resulting from encounters with boatsis 83.

The prediction is not reasonably close to 79.

Step by step solution

01

Given information

Values are given on two variables namely, pleasure boats and manatee fatalities.

02

Calculate the mean values

Let x represent thepleasure boats.

Let y represent themanatee fatalities.

Themean value of xis given as,

\(\begin{array}{c}\bar x = \frac{{\sum\limits_{i = 1}^n {{x_i}} }}{n}\\ = \frac{{99 + 99 + .... + 90}}{9}\\ = 93\end{array}\)

Therefore, the mean value of x is 93.

Themean value of yis given as,

\(\begin{array}{c}\bar y = \frac{{\sum\limits_{i = 1}^n {{y_i}} }}{n}\\ = \frac{{92 + 73 + .... + 68}}{9}\\ = 82.778\end{array}\)

Therefore, the mean value of y is 82.778.

03

Calculate the standard deviation of x and y

The standard deviation of xis given as,

\(\begin{array}{c}{s_x} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({x_i} - \bar x)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {99 - 93} \right)}^2} + {{\left( {99 - 93} \right)}^2} + ..... + {{\left( {90 - 93} \right)}^2}}}{{9 - 1}}} \\ = 4.528\end{array}\)

Therefore, the standard deviation of x is 4.528.

The standard deviation of y is given as,

\(\begin{array}{c}{s_y} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({y_i} - \bar y)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {92 - 82.778} \right)}^2} + {{\left( {73 - 82.778} \right)}^2} + ..... + {{\left( {68 - 82.778} \right)}^2}}}{{9 - 1}}} \\ = 9.871\end{array}\)

Therefore, the standard deviation of y is 9.871.

04

Calculate the correlation coefficient

The correlation coefficient is given as,

\(r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\)

The calculations required to compute the correlation coefficient are as follows,

The correlation coefficient is given as,

\(\begin{array}{c}r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\\ = \frac{{9\left( {69407} \right) - \left( {837} \right)\left( {745} \right)}}{{\sqrt {\left( {\left( {9 \times 78005} \right) - {{\left( {837} \right)}^2}} \right)\left( {\left( {9 \times 62449} \right) - {{\left( {745} \right)}^2}} \right)} }}\\ = 0.3412\end{array}\)

Therefore, the correlation coefficient is 0.3412.

05

Calculate the slope of the regression line

The slopeof the regression line is given as,

\(\begin{array}{c}{b_1} = r\frac{{{s_Y}}}{{{s_X}}}\\ = 0.3412 \times \frac{{9.871}}{{4.528}}\\ = 0.744\end{array}\)

Therefore, the value of slope is 0.744.

06

Calculate the intercept of the regression line

The interceptis computed as,

\(\begin{array}{c}{b_0} = \bar y - {b_1}\bar x\\ = 82.778 - \left( {0.744 \times 93} \right)\\ = 13.586\\ \approx 13.6\end{array}\)

Therefore, the value of intercept is 13.6.

07

Form a regression equation

Theregression equation is given as,

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 13.6 + 0.744x\end{array}\)

Thus, the regression equation is \(\hat y = 13.6 + 0.744x\).

08

Analyze the model

Referring to exercise 24 of section 10-1,

1)The scatter plot does not show a linear relationship between the variables.

2)The P-value is 0.369.

As the P-value is greater than the level of significance (0.05), this implies the null hypothesis fails to be rejected.

Therefore, the correlation is not significant.

Referring to figure 10-5, thecriteria for a good regression model are not satisfied.

Therefore, the regression equation cannot be used to predict the value of y.

09

Predict the value

The best predicted number of manatee fatalities resulting from encounters with boatsis the mean number ofManatee Fatalities; that is 82.778 approximately equal to 83.

Therefore, thebest predicted number of manatee fatalities resulting from encounters with boatsis 83.

The prediction value is 83 which is not reasonably close to 79.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

In Exercises 5–8, we want to consider the correlation between heights of fathers and mothers and the heights of their sons. Refer to the

StatCrunch display and answer the given questions or identify the indicated items.

The display is based on Data Set 5 “Family Heights” in Appendix B.

A son will be born to a father who is 70 in. tall and a mother who is 60 in. tall. Use the multiple regression equation to predict the height of the son. Is the result likely to be a good predicted value? Why or why not?

Cigarette Tar and Nicotine The table below lists measured amounts (mg) of tar, carbonmonoxide (CO), and nicotine in king size cigarettes of different brands (from Data Set 13“Cigarette Contents” in Appendix B).

a. Is there is sufficient evidence to support a claim of a linear correlation between tar and nicotine?

b. What percentage of the variation in nicotine can be explained by the linear correlation between nicotine and tar?

c. Letting yrepresent the amount of nicotine and letting xrepresent the amount of tar, identify the regression equation.

d. The Raleigh brand king size cigarette is not included in the table, and it has 23 mg of tar. What is the best predicted amount of nicotine? How does the predicted amount compare to the actual amount of 1.3 mg of nicotine?

Tar

25

27

20

24

20

20

21

24

CO

18

16

16

16

16

16

14

17

Nicotine

1.5

1.7

1.1

1.6

1.1

1.0

1.2

1.4

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

POTUS Media periodically discuss the issue of heights of winning presidential candidates and heights of their main opponents. Listed below are those heights (cm) from severalrecent presidential elections (from Data Set 15 “Presidents” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between heights of winning presidential candidates and heights of their main opponents? Should there be such a correlation?

President

178

182

188

175

179

183

192

182

177

185

188

188

183

188

Opponent

180

180

182

173

178

182

180

180

183

177

173

188

185

175

In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi , gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi , gal).

If exactly two predictor (x) variables are to be used to predict the city fuel consumption, which two variables should be chosen? Why?

Cell Phones and Driving In the author’s home town of Madison, CT, there were 2733 police traffic stops in a recent year, and 7% of them were attributable to improper use of cell phones. Use a 0.05 significance level to test the claim that the sample is from a population in which fewer than 10% of police traffic stops are attributable to improper cell phone use.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free