Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Variation and Prediction Intervals. In Exercises 17–20, find the (a) explained variation, (b) unexplained variation, and (c) indicated prediction interval. In each case, there is sufficient evidence to support a claim of a linear correlation, so it is reasonable to use the regression equation when making predictions.

Altitude and Temperature Listed below are altitudes (thousands of feet) and outside air temperatures (°F) recorded by the author during Delta Flight 1053 from New Orleans to Atlanta. For the prediction interval, use a 95% confidence level with the altitude of 6327 ft (or 6.327 thousand feet).

Altitude (thousands of feet)

3

10

14

22

28

31

33

Temperature (°F)

57

37

24

-5

-30

-41

-54

Short Answer

Expert verified

a)Explained variation:10626.59

(b) Unexplained variation:68.83577

(c) 95%prediction interval: (38.0,60.4)

Step by step solution

01

Given information

Data are given on two variables “Altitude (thousands of feet)” and “Temperature (in degrees Fahrenheit).”

02

Obtain the regression equation

Let x denote the variable “Altitude.”

Let y denote the variable “Temperature.”

The regression equation of y on x has the following notation:

\(\hat y = {b_0} + {b_1}x\),

where

\({b_0}\)is the intercept term and

\({b_1}\)is the slope coefficient.

The following calculations are done to compute the intercept and the slope coefficient:

The y-intercept is computed below:

\(\begin{array}{c}{b_0} = \frac{{\left( {\sum y } \right)\left( {\sum {{x^2}} } \right) - \left( {\sum x } \right)\left( {\sum {xy} } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{\left( { - 12} \right)\left( {3623} \right) - \left( {141} \right)\left( { - 3126} \right)}}{{7\left( {3623} \right) - {{\left( {141} \right)}^2}}}\\ = 72.49817518\end{array}\)

The slope coefficient is computed below:

\(\begin{array}{c}{b_1} = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{\left( 7 \right)\left( { - 3126} \right) - \left( {141} \right)\left( { - 12} \right)}}{{7\left( {3623} \right) - {{\left( {141} \right)}^2}}}\\ = - 3.68430656\end{array}\)

Thus, the regression equation becomes

\(\hat y = 72.49817518 - 3.68430656x\).

03

Calculate the explained and unexplained variations

The following table shows the predicted values (upon substituting the values of x in the regression equation):

The mean value of observed y is computed below:

\(\begin{array}{c}\bar y = \frac{{\sum y }}{n}\\ = \frac{{ - 12}}{7}\\ = - 1.71429\end{array}\)

Some important calculations are done below:

The explained variation is \(\sum {{{\left( {\hat y - \bar y} \right)}^2}} = 10626.59\).

Thus, the explained variation is equal to 10626.59.

The unexplained variation is \(\sum {{{\left( {y - \hat y} \right)}^2}} = 68.83577\).

Thus, the unexplained variation is equal to 68.83577.

04

Predicted value at \(\left( {{x_0}} \right)\)

Substitute\({x_0} = 6.327\)in the regression equation to get the predicted value.

\(\begin{array}{c}\hat y = 72.49817518 - 3.68430656x\\ = 72.49817518 - 3.68430656\left( {6.327} \right)\\ = 49.18757\\ \approx 49\end{array}\)

05

Formula of prediction interval

The prediction interval is obtained using the formula

\(\begin{array}{c}PI = \hat y \pm E\\ = \hat y \pm {t_{\frac{\alpha }{2}}}{s_e}\sqrt {1 + \frac{1}{n} + \frac{{n{{\left( {{x_0} - \bar x} \right)}^2}}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}} \end{array}\).

06

Degrees of freedom and critical value

The following formula is used to compute the level of significance.

\(\begin{array}{c}{\rm{Confidence}}\;{\rm{Leve}}l = 95\% \\100\left( {1 - \alpha } \right) = 95\\1 - \alpha = 0.95\\\alpha = 0.05\end{array}\)

The degrees of freedom for computing the t-multiplier are shownbelow:

\(\begin{array}{c}df = n - 2\\ = 7 - 2\\ = 5\end{array}\)

The two-tailed value of the t-multiplier for the level of significance (0.05) and degrees of freedom (5) is 2.5706.

07

Standard error of the estimate

The standard error of the estimate is computed below:

\(\begin{array}{c}{s_e} = \sqrt {\frac{{\sum {{{\left( {y - \hat y} \right)}^2}} }}{{n - 2}}} \\ = \sqrt {\frac{{68.83577}}{{7 - 2}}} \\ = 3.710411\end{array}\)

08

Value of \(\bar x\)

The value of \(\bar x\)is computed as follows:

\(\begin{array}{c}\bar x = \frac{{\sum x }}{n}\\ = \frac{{141}}{7}\\ = 20.14286\end{array}\)

09

Calculate the prediction interval

Substitute the values obtained above to calculate the margin of error (E).

\(\begin{array}{c}E = {t_{\frac{\alpha }{2}}}{s_e}\sqrt {1 + \frac{1}{n} + \frac{{n{{\left( {{x_0} - \bar x} \right)}^2}}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}} \\ = \left( {2.5706} \right)\left( {3.710411} \right)\sqrt {1 + \frac{1}{7} + \frac{{7{{\left( {6.327 - 20.14286} \right)}^2}}}{{7\left( {3623} \right) - \left( {19881} \right)}}} \\ = 11.23167664\end{array}\)

Thus, the prediction interval (PI) becomes as shown:

\(\begin{array}{c}PI = \left( {\hat y - E,\hat y + E} \right)\\ = \left( {49.18757 - 11.23167664,49.18757 + 11.23167664} \right)\\ = (37.9559,60.4192)\\ \approx \left( {38.0,60.4} \right)\end{array}\)

Therefore, the 95% prediction interval for the temperature (in degrees Fahrenheit) for the given altitude of 6.327 is (38.0, 60.4).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fentanyl) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain Intensity Before Duragesic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6










Pain Intensity After Duragesic Treatment

0.4

1.4

1.8

2.9

6

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8

6.8

2.3

0.4

0.7

1.2

4.5

2

1.6

2

2

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1










Correlation Use the given data to construct a scatterplot, then use the methods of Section 10-1 to test for a linear correlation between the pain intensity before and after treatment. If there does appear to be a linear correlation, can we conclude that the drug treatment is effective?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Tips Listed below are amounts of bills for dinner and the amounts of the tips that were left. The data were collected by students of the author. Is there sufficient evidence to conclude that there is a linear correlation between the bill amounts and the tip amounts? If everyone were to tip with the same percentage, what should be the value of r?

Bill(dollars)

33.46

50.68

87.92

98.84

63.6

107.34

Tip(dollars)

5.5

5

8.08

17

12

16

Stocks and Sunspots. Listed below are annual high values of the Dow Jones Industrial Average (DJIA) and annual mean sunspot numbers for eight recent years. Use the data for Exercises 1–5. A sunspot number is a measure of sunspots or groups of sunspots on the surface of the sun. The DJIA is a commonly used index that is a weighted mean calculated from different stock values.

DJIA

14,198

13,338

10,606

11,625

12,929

13,589

16,577

18,054

Sunspot Number

7.5

2.9

3.1

16.5

55.7

57.6

64.7

79.3

z Scores Using only the sunspot numbers, identify the highest number and convert it to a z score. In the context of these sample data, is that highest value “significantly high”? Why or why not?

Cell Phones and Driving In the author’s home town of Madison, CT, there were 2733 police traffic stops in a recent year, and 7% of them were attributable to improper use of cell phones. Use a 0.05 significance level to test the claim that the sample is from a population in which fewer than 10% of police traffic stops are attributable to improper cell phone use.

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Weighing Seals with a Camera Listed below are the overhead widths (cm) of seals

measured from photographs and the weights (kg) of the seals (based on “Mass Estimation of Weddell Seals Using Techniques of Photogrammetry,” by R. Garrott of Montana State University). The purpose of the study was to determine if weights of seals could be determined from overhead photographs. Is there sufficient evidence to conclude that there is a linear correlation between overhead widths of seals from photographs and the weights of the seals?

Overhead Width

7.2

7.4

9.8

9.4

8.8

8.4

Weight

116

154

245

202

200

191

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free