Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed duration and interval after times, find the best predicted “interval after” time for an eruption with a duration of 253 seconds. How does it compare to an actual eruption with a duration of 253 seconds and an interval after time of 83 minutes?

Short Answer

Expert verified

The regression equation is\(\hat y = 90.190 + 0.007X\)

The best predicted ‘interval after’ time for an eruption with a duration of 253 seconds will be approximately 92 (minutes).

There is an error of approximately 9 minutes in prediction. This is because the actual eruption with a duration of 253 seconds and an ‘interval after’ time is 83 minutes.

Step by step solution

01

Given information

The given data render the ‘interval after’ (in minutes) and eruption with the duration as follows.

The actual eruption with a duration of 253 seconds has an ‘interval after’ value of 83 minutes.

02

State the estimated regression line

The formula for the estimated regression line is

\(y = {b_0} + {b_1}x\),

where

\({b_0}\)is the Y-intercept,

\({b_1}\)is the slope,

\(x\)is the explanatory variable, and

\(\hat y\)is the response variable (predicted value).

Let X denote the duration (in seconds), and Y denote the ‘interval after’ (in minutes).

03

Compute the slope and intercept

The calculations required to compute the slope and intercept are as follows.

The sample size \(\left( n \right) = 7\).

The slope is computed as

\(\begin{array}{c}{b_1} = \frac{{n\left( {\sum {xy} } \right) - \left( {\sum x } \right)\left( {\sum y } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{7 \times 145348 - 1584 \times 642}}{{7 \times 369212 - {{1584}^2}}}\\ = 0.006735\\ \approx 0.00673\end{array}\).

The intercept is computed as

\(\begin{array}{c}{b_0} = \frac{{\left( {\sum y } \right)\left( {\sum {{x^2}} } \right) - \left( {\sum x } \right)\left( {\sum {xy} } \right)}}{{n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}}}\\ = \frac{{642 \times 369212 - 1584 \times 145348}}{{7 \times 369212 - {{1584}^2}}}\\ = 90.19027\\ \approx 90.190\end{array}\).

So, the estimated regression equation is

\(\begin{array}{c}\hat y = {b_0} + {b_1}x\\ = 90.2 + 0.00673x\end{array}\)

04

Check the model

Refer to exercise 21 of section 10-1 for the following result.

1) The scatter plot does not show an approximate linear relationship between the variables.

2)The P-value is 0.921.

As theP-value is greater than the level of significance (0.05), the null hypothesis fails to be rejected.

Therefore, the correlation is not significant.

Referring to figure 10-5, the criteria for a good regression model are not satisfied.

As the model is bad, the best-predicted value of a variable is its sample mean.

05

Compute the predicted value

The best-predicted interval after times for an eruption with a duration of 253 seconds is required to be obtained.

As this is a bad model, the sample mean of the response variable will be used to predict the value.

The sample meanfor the response variable is

\(\begin{array}{c}\bar y = \frac{{\sum {{y_i}} }}{n}\\ = \frac{{91 + 81 + ... + 91}}{7}\\ = 91.7143\end{array}\).

Therefore, the best-predicted interval after time for an eruption with a duration of 253 seconds will be approximately 91.7 minutes.

Thus, the actual value of 83 minutes differs significantly from 91.7 minutes.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

The sample data result in a linear correlation coefficient of r= 0.499 and the regression equation\(\hat y = 3.83 + 2.39x\). What is the best predicted number of burglaries, given an enrollment of 50 (thousand), and how was it found?

Interpreting r. In Exercises 5–8, use a significance level of A = 0.05 and refer to the accompanying displays.

5. Bear Weight and Chest Size Fifty-four wild bears were anesthetized, and then their weights and chest sizes were measured and listed in Data Set 9 “Bear Measurements” in Appendix B; results are shown in the accompanying Statdisk display. Is there sufficient evidence to support the claim that there is a linear correlation between the weights of bears and their chest sizes? When measuring an anesthetized bear, is it easier to measure chest size than weight? If so, does it appear that a measured chest size can be used to predict the weight?

In Exercises 9–12, refer to the accompanying table, which was obtained using the data from 21 cars listed in Data Set 20 “Car Measurements” in Appendix B. The response (y) variable is CITY (fuel consumption in mi, gal). The predictor (x) variables are WT (weight in pounds), DISP (engine displacement in liters), and HWY (highway fuel consumption in mi, gal).

A Honda Civic weighs 2740 lb, it has an engine displacement of 1.8 L, and its highway fuel consumption is 36 mi/gal. What is the best predicted value of the city fuel consumption? Is that predicted value likely to be a good estimate? Is that predicted value likely to be very accurate?

Interpreting the Coefficient of Determination. In Exercises 5–8, use the value of the linear correlation coefficient r to find the coefficient of determination and the percentage of the total variation that can be explained by the linear relationship between the two variables.

Pizza and Subways r = 0.992 (x = cost of a slice of pizza, y = subway fare in New York City

Finding a Prediction Interval. In Exercises 13–16, use the paired data consisting of registered Florida boats (tens of thousands) and manatee fatalities from boat encounters listed in Data Set 10 “Manatee Deaths” in Appendix B. Let x represent number of registered boats and let y represent the corresponding number of manatee deaths. Use the given number of registered boats and the given confidence level to construct a prediction interval estimate of manatee deaths.

Boats Use x = 85 (for 850,000 registered boats) with a 99% confidence level.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free