Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

When coastal power stations take in large quantities of cooling water, it is inevitable that a number of fish are drawn in with the water. Various methods have been designed to screen out the fish. The article "Multiple \(\mathrm{Re}-\) gression Analysis for Forecasting Critical Fish Influxes at Power Station Intakes" (Journal of Applied Ecology [1983]: 33-42) examined intake fish catch at an English power plant and several other variables thought to affect fish intake: $$ \begin{aligned} y &=\text { fish intake (number of fish) } \\ x_{1} &=\text { water temperature }\left({ }^{\circ} \mathrm{C}\right) \\ x_{2} &=\text { number of pumps running } \\ x_{3} &=\text { sea state }(\text { values } 0,1,2, \text { or } 3) \\ x_{4} &=\text { speed }(\text { knots }) \end{aligned} $$ Part of the data given in the article were used to obtain the estimated regression equation $$ \hat{y}=92-2.18 x_{1}-19.20 x_{2}-9.38 x_{3}+2.32 x_{4} $$ (based on \(n=26\) ). SSRegr \(=1486.9\) and SSResid = \(2230.2\) were also calculated. a. Interpret the values of \(b_{1}\) and \(b_{4}\). b. What proportion of observed variation in fish intake can be explained by the model relationship? c. Estimate the value of \(\sigma\). d. Calculate adjusted \(R^{2}\). How does it compare to \(R^{2}\) itself?

Short Answer

Expert verified
Part a: \(b_1\) represents a decrease of 2.18 in fish intake for every increase in water temperature while keeping other factors constant, and \(b_4\) represents an increase of 2.32 in fish intake for every increment in speed, keeping other factors constant. Part b: Approximately 40% of the variation in fish intake can be explained by the variables in this regression model. Part c: \(\sigma\) can be estimated using the provided formula and given SSResid value. Part d: Adjusted \(R^{2}\) can be estimated using the given formula and then compared to \(R^{2}\). The detailed values for \(\sigma\) and Adjusted \(R^{2}\) can be calculated using a calculator or software.

Step by step solution

01

Interpretation of \(b_{1}\) and \(b_{4}\) Values

For every unit increase in water temperature (x₁), fish intake (y) decreases by 2.18, assuming all other variables are constant. Similarly, for every unit increase in speed (x₄), fish intake (y) increases by 2.32, assuming all other variables are constant. This tells us about the effects of water temperature and speed on fish intake at the power plant.
02

Calculation of \(R^{2}\) and Interpretation

\(R^{2}\), the coefficient of determination, is calculated from the formula \[R^{2} = \frac{SSRegr}{SSTotal}= \frac{SSRegr}{SSRegr + SSResid} \] So, \(R^{2}\) = 1486.9 / (1486.9 + 2230.2) = 0.3998, or approximately 0.4. This means 40% of the variation in fish intake can be explained by this model.
03

Estimation of \(\sigma\)

The value of \(\sigma\) (standard deviation of the residuals) can be estimated by the formula \[ \sigma = \sqrt{\frac{SSResid}{n-p-1}}\] where, n is the sample size and p is the number of predictors. Thus, \(\sigma\) = \(\sqrt{\frac{2230.2}{26 - 4 - 1}}\). Place these values into a calculator to get an estimation of \(\sigma\).
04

Calculation and Comparison of Adjusted \(R^{2}\)

The adjustment in the \(R^{2}\) value takes the number of predictors into account, which is better for comparison purposes if models have different numbers of predictors. It is calculated as follows: \[Adjusted \ R^{2} = 1 - (1 - R^{2})\frac{n-1}{n-p-1}\] where n is the number of observations and p is the number of predictors. Inserting the known values, the calculation of adjusted \(R^{2}\) can be made, and then it can be compared with \(R^{2}\).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article "The Caseload Controversy and the Study of Criminal Courts" (Journal of Criminal Law and Criminology [1979]: 89-101) used a multiple regression analysis to help assess the impact of judicial caseload on the processing of criminal court cases. Data were collected in the Chicago criminal courts on the following variables: $$ \begin{aligned} y &=\text { number of indictments } \\ x_{1} &=\text { number of cases on the docket } \end{aligned} $$ \(x_{2}=\) number of cases pending in criminal court trial system The estimated regression equation (based on \(n=367\) observations) was $$ \hat{y}=28-.05 x_{1}-.003 x_{2}+.00002 x_{3} $$ where \(x_{3}=x_{1} x_{2}\) a. The reported value of \(R^{2}\) was . 16. Conduct the model utility test. Use a \(.05\) significance level. b. Given the results of the test in Part (a), does it surprise you that the \(R^{2}\) value is so low? Can you think of a possible explanation for this? c. How does adjusted \(R^{2}\) compare to \(R^{2}\) ?

This exercise requires the use of a computer package. The authors of the article "Absolute Versus per Unit Body Length Speed of Prey as an Estimator of Vulnerability to Predation" (Animal Behaviour [1999]: \(347-\) 352) found that the speed of a prey (twips/s) and the length of a prey (twips \(\times 100\) ) are good predictors of the time (s) required to catch the prey. (A twip is a measure of distance used by programmers.) Data were collected in an experiment where subjects were asked to "catch" an animal of prey moving across his or her computer screen by clicking on it with the mouse. The investigators varied the length of the prey and the speed with which the prey moved across the screen. The following data are consistent with summary values and a graph given in the article. Each value represents the average catch time over all subjects. The order of the various speed-length combinations was randomized for each subject. $$ \begin{array}{ccc} \begin{array}{c} \text { Prey } \\ \text { Length } \end{array} & \begin{array}{l} \text { Prey } \\ \text { Speed } \end{array} & \begin{array}{l} \text { Catch } \\ \text { Time } \end{array} \\ \hline 7 & 20 & 1.10 \\ 6 & 20 & 1.20 \\ 5 & 20 & 1.23 \\ 4 & 20 & 1.40 \\ 3 & 20 & 1.50 \\ 3 & 40 & 1.40 \\ 4 & 40 & 1.36 \\ 6 & 40 & 1.30 \\ 7 & 40 & 1.28 \\ 7 & 80 & 1.40 \\ 6 & 60 & 1.38 \\ 5 & 80 & 1.40 \\ 7 & 100 & 1.43 \\ 6 & 100 & 1.43 \\ 7 & 120 & 1.70 \\ 5 & 80 & 1.50 \\ 3 & 80 & 1.40 \\ 6 & 100 & 1.50 \\ 3 & 120 & 1.90 \\ & & \\ \hline \end{array} $$ a. Fit a multiple regression model for predicting catch time using prey length and speed as predictors. b. Predict the catch time for an animal of prey whose length is 6 and whose speed is 50 . c. Is the multiple regression model useful for predicting catch time? Test the relevant hypotheses using \(\alpha=.05\). d. The authors of the article suggest that a simple linear regression model with the single predictor \(x=\frac{\text { length }}{\text { speed }}\) might be a better model for predicting catch time. Calculate the \(x\) values and use them to fit this linear regression model. e. Which of the two models considered (the multiple regression model from Part (a) or the simple linear regression model from Part (d)) would you recommend for predicting catch time? Justify your choice.

This exercise requires the use of a computer package. The article "Movement and Habitat Use by Lake Whitefish During Spawning in a Boreal Lake: Integrating Acoustic Telemetry and Geographic Information Systems" (Transactions of the American Fisheries Society [1999]:\(939-952\) ) included the accompanying data on 17 fish caught in two consecutive years. $$ \begin{array}{ccccc} \text { Year } & \begin{array}{l} \text { Fish } \\ \text { Number } \end{array} & \begin{array}{l} \text { Weight } \\ (\mathrm{g}) \end{array} & \begin{array}{l} \text { Length } \\ (\mathrm{mm}) \end{array} & \begin{array}{l} \text { Age } \\ \text { (years) } \end{array} \\ \hline \text { Year 1 } & 1 & 776 & 410 & 9 \\ & 2 & 580 & 368 & 11 \\ & 3 & 539 & 357 & 15 \\ & 4 & 648 & 373 & 12 \\ & 5 & 538 & 361 & 9 \\ & 6 & 891 & 385 & 9 \\ & 7 & 673 & 380 & 10 \\ & 8 & 783 & 400 & 12 \\ \text { Year 2 } & 9 & 571 & 407 & 12 \\ & 10 & 627 & 410 & 13 \\ & 11 & 727 & 421 & 12 \\ & 12 & 867 & 446 & 19 \\ & 13 & 1042 & 478 & 19 \\ & 14 & 804 & 441 & 18 \\ & 15 & 832 & 454 & 12 \\ & 16 & 764 & 440 & 12 \\ & 17 & 727 & 427 & 12 \\ \hline \end{array} $$ a. Fit a multiple regression model to describe the relationship between weight and the predictors length and age. b. Carry out the model utility test to determine whether the predictors length and age, together, are useful for predicting weight.

The following statement appeared in the article "Dimensions of Adjustment Among College Women" (Journal of College Student Development \([1998]: 364):\) Regression analyses indicated that academic adjustment and race made independent contributions to academic achievement, as measured by current GPA. Suppose $$ \begin{aligned} y &=\text { current GPA } \\ x_{1} &=\text { academic adjustment score } \\ x_{2} &=\text { race (with white }=0 \text { , other }=1) \end{aligned} $$ What multiple regression model is suggested by the statement? Did you include an interaction term in the model? Why or why not?

A number of investigations have focused on the problem of assessing loads that can be manually handled in a safe manner. The article "Anthropometric, Muscle Strength, and Spinal Mobility Characteristics as Predictors in the Rating of Acceptable Loads in Parcel Sorting" (Ergonomics [1992]: \(1033-1044\) ) proposed using a regression model to relate the dependent variable \(y=\) individual's rating of acceptable load \((\mathrm{kg})\) to \(k=3\) independent (predictor) variables: \(x_{1}=\) extent of left lateral bending \((\mathrm{cm})\) $$ \begin{aligned} &x_{2}=\text { dynamic hand grip endurance (sec) } \\ &x_{3}=\text { trunk extension ratio }(\mathrm{N} / \mathrm{kg}) \end{aligned} $$ Suppose that the model equation is $$ y=30+.90 x_{1}+.08 x_{2}-4.50 x_{3}+e $$ and that \(\sigma=5\). a. What is the population regression function? b. What are the values of the population regression \(\underline{\mathrm{co}}\) efficients? c. Interpret the value of \(\beta_{1}\). d. Interpret the value of \(\beta_{3}\). e. What is the mean value of rating of acceptable load when extent of left lateral bending is \(25 \mathrm{~cm}\), dynamic hand grip endurance is \(200 \mathrm{sec}\), and trunk extension ratio is \(10 \mathrm{~N} / \mathrm{kg}\) ? f. If repeated observations on rating are made on different individuals, all of whom have the values of \(x_{1}, x_{2}\), and \(x_{3}\) specified in Part (e), in the long run approximately what percentage of ratings will be between \(13.5 \mathrm{~kg}\) and \(33.5 \mathrm{~kg} ?\)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free