Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

T12.12 Foresters are interested in predicting the amount of usable lumber they can harvest from various tree species. They collect data on the diameter at breast height (DBH) in inches and the yield in board feet of a random sample of 20 Ponderosa pine trees that have been harvested. (Note that a board foot is defined as a piece of lumber 12 inches by 12 inches by 1 inch.) Here is a scatterplot of the data.

a. Here is some computer output and a residual plot from a least-squares regression on these data. Explain why a linear model may not be appropriate in this case.

The foresters are considering two possible transformations of the original data: (1) cubing the diameter values or (2) taking the natural logarithm of the yield measurements. After transforming the data, a least-squares regression analysis is performed. Here is some computer output and a residual plot for each of the two possible regression models:

b. Use both models to predict the amount of usable lumber from a Ponderosa pine with diameter 30 inches.
c. Which of the predictions in part (b) seems more reliable? Give appropriate evidence to support your choice.

Short Answer

Expert verified

(a) The pattern in the residual plot involves substantial curvature, a linear model will not be appropriate because the variables have a curved connection.

(b) The predicted yield for option 1is 117.0899board feet and the predicted yield for option 2 is 102.967board feet.

(c) Option 1 is the better option for prediction.

Step by step solution

01

Part (a) Step 1: Given information

To determine that a linear model may not be appropriate in this case.

02

Part (a) Step 2: Explanation

Foresters want to know how much useful lumber they'll be able to get from different tree species.
They took measurements of a random sample of Ponderosa pine trees' diameter at breast height in inches and yield in broad feet.
In the question, will find the computer output as well as a residual plot from least square regression.
Because the pattern in the residual plot involves substantial curvature, a linear model will not be acceptable because the variables have a curved connection.

03

Part (b) Step 1: Given information

To use both models to predict the amount of usable lumber from a Ponderosa pine with diameter 30 inches.

04

Part (b) Step 2: Explanation

Foresters want to know how much useful lumber they will be able to collect from different tree types. They measured the diameter of a random sample of Ponderosa pine trees at breast height in inches and the yield in broad feet. The question includes the computer results as well as a residual graphic from a least square regression. The foresters are exploring cubing the diameter values or taking the natural logarithm of the yields measurements as two feasible modifications of the original data.
As a result, the general equation of the least square regression line for option 1 is:
y^=b0+b1x
The value of the constant b0 is calculated as follows in the computer output's row "Constant" and column "Coef":
b0=2.078
The value of the constant b1is calculated as follows in the computer output's row "DBH3" and column "Coef":
b1=0.0042597

In the general equation, replace b0with 2.078and b1with b1=0.0042597.
y^=b0+b1x
y^=2.078+0.0042597x

Hence the cubic equation is calculated as:

y^=2.078+0.0042597x3

Substitute xfor 30:

y^=2.078+0.0042597x3

=2.078+0.0042597(30)3

=117.0899

As a result, the predicted yield is 117.0899board feet.

05

Part (b) Step 3: Explanation

Then, the general equation of the least square regression line for option 2 is:
y^=b0+b1x
The value of the constant $b 0$ is calculated as follows in the computer output's row "Constant" and column "Coef":
b0=1.2319

The value of the constant $b 1$ is calculated as follows in the computer output's row "DBH" and column "Coef":
b1=0.113417
In the general equation, replace $b_{0}=1.2319$ and $b_{1}$ with $b_{1}=0.113417$,
y^=b0+b1x
y^=1.2319+0.113417x
Use the logarithm in the equation:
lny^=1.2319+0.113417x
Then multiply xby 30 to get:
lny^=1.2319+0.113417x
=1.2319+0.113417(30)
=4.63441
Take each side's exponential:
y^=elny^
=e4.63441
=102.967

As a result, the predicted yield is 102.967 board feet.

06

Part (c)  Step 1: Given information

To find the predictions in part (b) seems more reliable and to explain with appropriate evidence.

07

Part (c) Step 2: Explanation

Foresters want to know how much useful lumber they will be able to collect from different tree types. They measured the diameter of a random sample of Ponderosa pine trees at breast height in inches and the yield in broad feet.
The question includes the computer results as well as a residual graphic from a least square regression. The foresters are exploring cubing the diameter values or taking the natural logarithm of the yields measurements as two feasible modifications of the original data.
As a result, the residual plot of option 1 has no strong curvature, whereas the residual plot of option 2 has strong curvature.
Also, the model in option 1is appropriate for making predictions, but the model in option 2 is not.
Therefore, estimated that forecast using option 1 will be more accurate.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Sam has determined that the weights of unpeeled bananas from his local store have a mean of116grams with a standard deviation of 9grams. Assuming that the distribution of weight is approximately Normal, to the nearest gram, the heaviest 30%of these bananas weigh at least how much?

a.107g

b.121g

C.111g

d.125g

e.116g

Multiple Choice Select the best answer for Exercises 23-28. Exercises 23-28 refer to the following setting. To see if students with longer feet tend to be taller, a random sample of 25students was selected from a large high school. For each student, x=footlength&y=heightwere recorded. We checked that the conditions for inference about the slope of the population regression line are met. Here is a portion of the computer output from a least-squares regression analysis using these data:

Which of the following is a 95%confidence interval for the population slope β1?

a.3.0867±0.4117

b. 3.0867±0.8518

c.3.0867±0.8069

d.3.0867±0.8497

e.localid="1654193042763" 3.0867±0.8481

Prey attracts predators . Here is computer output from the least-squares regression analysis of the perch data

a. What is the estimate for β0? Interpret this value.

b. What is the estimate for β1? Interpret this value.

c. What is the estimate for σ? Interpret this value.

d. Give the standard error of the slope SEb1. Interpret this value.

Beer and BAC Refer to Exercise 5. Here is computer output from the least-squares regression analysis of the beer and blood alcohol data.

a. What is the estimate for β0? Interpret this value.

b. What is the estimate for β1? Interpret this value.

c. What is the estimate for σ? Interpret this value.

d. Give the standard error of the slope SEb1. Interpret this value.

Beer and BAC How well does the number of beers a person drinks predict his or her blood alcohol content (BAC)? Sixteen volunteers aged 21or older with an initial BAC of 0took part in a study to find out. Each volunteer drank a randomly assigned number of cans of beer. Thirty minutes later, a police officer measured their BAC. A least-squares regression analysis was performed on the data using x=number of beers and y=BAC. Here is a residual plot and a histogram of the residuals. Check whether the conditions for performing inference about the regression model are met.

a. Find the critical value for a 99%confidence interval for the slope of the true regression line. Then calculate the confidence interval.

b. Interpret the interval from part (a).

c. Explain the meaning of “localid="1654184305701" 99%confident” in this context

Here is computer output from the least-squares regression analysis of the beer and blood alcohol dat

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free