Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

In Exercises 9 and 10, use the given data to find the equation of the regression line. Examine the scatterplot and identify a characteristic of the data that is ignored by the regression line.

Short Answer

Expert verified

The regression equation is \(\hat y = 3.00 + 0.500x\).

The scatterplot is:

There exists an outlier in the data.

Step by step solution

01

Given information

Values are given on two variables namely, x and y.

02

Calculate the mean of x and y

Themean value of xis given as,

\(\begin{array}{c}\bar x = \frac{{\sum\limits_{i = 1}^n {{x_i}} }}{n}\\ = \frac{{10 + 8 + .... + 5}}{{11}}\\ = 9\end{array}\)

Therefore, the mean value of x is 9.

Themean value of yis given as,

\(\begin{array}{c}\bar y = \frac{{\sum\limits_{i = 1}^n {{y_i}} }}{n}\\ = \frac{{7.46 + 6.77 + .... + 5.73}}{{11}}\\ = 7.5\end{array}\)

Therefore, the mean value of y is 7.5

03

Calculate the standard deviation of x and y

The standard deviation of x is given as,

\(\begin{array}{c}{s_x} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({x_i} - \bar x)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {10 - 9} \right)}^2} + {{\left( {8 - 9} \right)}^2} + ... + {{\left( {5 - 9} \right)}^2}}}{{11 - 1}}} \\ = 3.3166\end{array}\)

Therefore, the standard deviation of x is 3.3166.

The standard deviation of y is given as,

\(\begin{array}{c}{s_y} = \sqrt {\frac{{\sum\limits_{i = 1}^n {{{({y_i} - \bar y)}^2}} }}{{n - 1}}} \\ = \sqrt {\frac{{{{\left( {7.46 - 7.5} \right)}^2} + {{\left( {6.77 - 7.5} \right)}^2} + ..... + {{\left( {5.73 - 7.5} \right)}^2}}}{{11 - 1}}} \\ = 2.0304\end{array}\)

Therefore, the standard deviation of y is 2.0304.

04

Calculate the correlation coefficient

The correlation coefficient is given as,

\(r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\)

The calculations required to compute the correlation coefficient are as follows:

The correlation coefficient is given as,

\(\begin{array}{l}r = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {\left( {\left( {n\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \right)\left( {\left( {n\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} \right)} }}\\ = \frac{{11\left( {797.47} \right) - \left( {99} \right)\left( {82.5} \right)}}{{\sqrt {\left( {\left( {11 \times 1001} \right) - {{\left( {99} \right)}^2}} \right)\left( {\left( {11 \times 659.9762} \right) - {{\left( {82.5} \right)}^2}} \right)} }}\\ = 0.8163\end{array}\)

Therefore, the correlation coefficient is 0.8163.

05

Calculate the slope of the regression line

The slopeof the regression line is given as,

\(\begin{array}{c}{b_1} = r\frac{{{s_Y}}}{{{s_X}}}\\ = 0.8163 \times \frac{{2.030}}{{3.317}}\\ = 0.4996\\ \approx 0.500\end{array}\)

Therefore, the value of slope is 0.500.

06

Calculate the intercept of the regression line

The interceptis computed as,

\(\begin{array}{c}{b_0} = \bar y - {b_1}\bar x\\ = 7.5 - \left( {0.500 \times 9} \right)\\ = 3.002\end{array}\)

Therefore, the value of intercept is 3.00.

07

Form a regression equation

Theregression equationis given as,

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 3.002 + 0.500x\end{array}\)

Thus, the regression equation is \(\hat y = 3.002 + 0.500x\).

08

Construct a scatter plot

Use the following steps to plot a scatter plot between x and y:

  • Consider x and y.
  • Mark the values 0, 2, and so on until 14 on the vertical axis.
  • Mark the values 0, 5, and so on until 15 on the horizontal axis.
  • Plot the points on the graph corresponding to the pairs of values for the two variables.
  • Label the horizontal axis as “y” and the vertical axis as “x”.

The following scatterplot is generated:

09

State the characteristic ignored in the data

It can be observed from the above scatter plot that an observation (13,12.74) is extreme and deviates largely from a straight line pattern. Thus the characteristic that is been ignored is thatthere exists an outlier at (13,12.74).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Revised mpg Ratings Listed below are combined city-highway fuel economy ratings (in mi>gal) for different cars. The old ratings are based on tests used before 2008 and the new ratings are based on tests that went into effect in 2008. Is there sufficient evidence to conclude that there is a linear correlation between the old ratings and the new ratings? What do the data suggest about the old ratings?

Old

16

27

17

33

28

24

18

22

20

29

21

New

15

24

15

29

25

22

16

20

18

26

19

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed duration and interval after times, find the best predicted “interval after” time for an eruption with a duration of 253 seconds. How does it compare to an actual eruption with a duration of 253 seconds and an interval after time of 83 minutes?

let the predictor variable x be the first variable given. Use the given data to find the regression equation and the best predicted value of the response variable. Be sure to follow the prediction procedure summarized in Figure 10-5 on page 493. Use a 0.05 significance level.

Head widths (in.) and weights (lb) were measured for 20 randomly selected bears (from Data Set 9 “Bear Measurements” in Appendix B). The 20 pairs of measurements yield\(\bar x = 6.9\)in.,\(\bar y = 214.3\)lb, r= 0.879, P-value = 0.000, and\(\hat y = - 212 + 61.9x\). Find the best predicted value of\(\hat y\)(weight) given a bear with a head width of 6.5 in.

Confidence Intervals for a Regression Coefficients A confidence interval for the regression coefficient b1 is expressed

\(\begin{array}{l}{b_1} - E < {\beta _1} < {b_1} + E\\\end{array}\)

Where

\(E = {t_{\frac{\alpha }{2}}}{s_{{b_1}}}\)

The critical t score is found using n –(k+1) degrees of freedom, where k, n, and sb1 are described in Exercise 17. Using the sample data from Example 1, n = 153 and k = 2, so df = 150 and the critical t scores are \( \pm \)1.976 for a 95% confidence level. Use the sample data for Example 1, the Stat diskdisplay in Example 1 on page 513, and the Stat Crunchdisplay in Exercise 17 to construct 95% confidence interval estimates of \({\beta _1}\) (the coefficient for the variable representing height) and\({\beta _2}\) (the coefficient for the variable representing waist circumference). Does either confidence interval include 0, suggesting that the variable be eliminated from the regression equation?

Finding Critical r Values Table A-6 lists critical values of r for selected values of n and a. More generally, critical r values can be found by using the formula

\(r = \frac{t}{{\sqrt {{t^2} + n - 2} }}\)

where the t value is found from the table of critical t values (Table A-3) assuming a two-tailed case with n - 2 degrees of freedom. Use the formula for r given here and in Table A-3 (with n - 2 degrees of freedom) to find the critical r values corresponding to \({H_1}:\rho \ne 0\), \(\alpha \)= 0.02, and n = 27.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free