Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Recently sold, single-family homes. The National Association of Realtors maintains a database consisting of sales information on homes sold in the United States. The next table lists the sale prices for a sample of 28 recently sold, single-family homes. The table also identifies the region of the country in which the home is located and the total number of homes sold in the region during the month the home sold.

a) Propose a complete second-order model for the sale price of a single-family home as a function of region and sales volume.

b) Give the equation of the curve relating sale price to sales volume for homes sold in the West.

c) Repeat part b for homes sold in the Northwest.

d) Which b’s in the model, part a, allow for differences among the mean sale prices for homes in the four regions?

e) Fit the model, part a, to the data using an available statistical software package. Is the model statistically useful for predicting sale price? Test using α = .01.

Short Answer

Expert verified

a) A complete second-order model for the sale price of a single-family home as a function of region and sales volume can be written as

b) The equation of the curve relating sale price to sales volume for homes sold in the West can be written when X2= 0, X3 = 0 and X4= 0.The equation becomes.

c) The equation of the curve relating sale price to sales volume for homes sold in the NorthWest can be written when X2= 0, X3 = 1 and X4= 0.The equation becomes.

d) β2, β3, and β4 represent the differences among the mean sale prices for homes in the four regions. β2 represents difference in mean sale price of NE region and west region, β3 represents difference in mean sale price of NW region and west region, and β4represents difference in mean sale price of south region and west region.

e) It can be concluded with 99% confidence interval that the model is not statistically useful for predicting sale price.

Step by step solution

01

Second order model

A complete second-order model for the sale price of a single-family home as a function of region and sales volume can be written as

Where, x1= sales volume

X2= 1 if region is NE; 0 otherwise

X3= 1 if region is NW; 0 otherwise

X4= 1 if region is S; 0 otherwise

02

Subsequent order model equation

The equation of the curve relating sale price to sales volume for homes sold in the

West can be written when X2= 0, X3 = 0 and X4= 0

03

Next order model equation 

The equation of the curve relating sale price to sales volume for homes sold in the

Northwest can be written when X2= 0, X3 = 1 and X4= 0

04

Interpretation of β

β2, β3, and β4 represent the differences among the mean sale prices for homes in the four regions. Β2 represents difference in mean sale price of NE region and west region, β3 represents difference in mean sale price of NW region and west region, and β4represents difference in mean sale price of south region and west region.

05

Model fitted to “part a” equation

The excel output is presented below

SUMMARY OUTPUT

















Regression Statistics








Multiple R

0.921704








R Square

0.849538








Adjusted R Square

0.746095








Standard Error

24365.83








Observations

28

















ANOVA









df

SS

MS

F

Significance F




Regression

11

5.36E+10

4.88E+09

8.212628

0.000112




Residual

16

9.5E+09

5.94E+08






Total

27

6.31E+10













Coefficients

Standard Error

t Stat

P-value

Lower 95%

Upper 95%

Lower 95.0%

Upper 95.0%

Intercept

5568060

4015347

1.386695

0.184553

-2944095

14080214

-2944095

14080214

Sales Volume(x1)

-107.648

73.57124

-1.46319

0.162784

-263.613

48.31561

-263.613

48.31561

x2

-3663319

4478880

-0.81791

0.425422

-1.3E+07

5831482

-1.3E+07

5831482

x3

-3503659

4058018

-0.86339

0.40068

-1.2E+07

5098955

-1.2E+07

5098955

x4

1628588

5945303

0.273929

0.787644

-1.1E+07

14232068

-1.1E+07

14232068

x1^2

0.00054

0.000337

1.604055

0.128257

-0.00017

0.001254

-0.00017

0.001254

x1*x2

37.21225

103.005

0.361266

0.722626

-181.149

255.5732

-181.149

255.5732

x1*x3

59.45824

75.18596

0.790816

0.440617

-99.9289

218.8454

-99.9289

218.8454

x1*x4

13.35528

93.25644

0.14321

0.887912

-184.34

211.0501

-184.34

211.0501

x1^2*x2

0.000181

0.000733

0.246847

0.808166

-0.00137

0.001736

-0.00137

0.001736

x1^2*x3

-0.00024

0.000351

-0.68403

0.503745

-0.00098

0.000504

-0.00098

0.000504

x1^2*x4

-0.00022

0.000385

-0.58005

0.56996

-0.00104

0.000593

-0.00104

0.000593

To test the significance of the model F-test is conducted. The null and alternate hypothesis would be

Therefore, the model is not statistically useful for predicting sale price.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: Consider the model:

y=β0+β1x1+β2x2+β3x3+ε

where x1 is a quantitative variable and x2 and x3 are dummy variables describing a qualitative variable at three levels using the coding scheme

role="math" localid="1649846492724" x2=1iflevel20otherwisex3=1iflevel30otherwise

The resulting least squares prediction equation is y^=44.8+2.2x1+9.4x2+15.6x3

a. What is the response line (equation) for E(y) when x2 = x3 = 0? When x2 = 1 and x3 = 0? When x2 = 0 and x3 = 1?

b. What is the least squares prediction equation associated with level 1? Level 2? Level 3? Plot these on the same graph.

Question: Shared leadership in airplane crews. Refer to the Human Factors (March 2014) study of shared leadership by the cockpit and cabin crews of a commercial airplane, Exercise 8.14 (p. 466). Recall that simulated flights were taken by 84 six-person crews, where each crew consisted of a 2-person cockpit (captain and first officer) and a 4-person cabin team (three flight attendants and a purser.) During the simulation, smoke appeared in the cabin and the reactions of the crew were monitored for teamwork. One key variable in the study was the team goal attainment score, measured on a 0 to 60-point scale. Multiple regression analysis was used to model team goal attainment (y) as a function of the independent variables job experience of purser (x1), job experience of head flight attendant (x2), gender of purser (x3), gender of head flight attendant (x4), leadership score of purser (x5), and leadership score of head flight attendant (x6).

a. Write a complete, first-order model for E(y) as a function of the six independent variables.

b. Consider a test of whether the leadership score of either the purser or the head flight attendant (or both) is statistically useful for predicting team goal attainment. Give the null and alternative hypotheses as well as the reduced model for this test.

c. The two models were fit to the data for the n = 60 successful cabin crews with the following results: R2 = .02 for reduced model, R2 = .25 for complete model. On the basis of this information only, give your opinion regarding the null hypothesis for successful cabin crews.

d. The p-value of the subset F-test for comparing the two models for successful cabin crews was reported in the article as p 6 .05. Formally test the null hypothesis using α = .05. What do you conclude?

e. The two models were also fit to the data for the n = 24 unsuccessful cabin crews with the following results: R2 = .14 for reduced model, R2 = .15 for complete model. On the basis of this information only, give your opinion regarding the null hypothesis for unsuccessful cabin crews.

f. The p-value of the subset F-test for comparing the two models for unsuccessful cabin crews was reported in the article as p < .10. Formally test the null hypothesis using α = .05. What do you conclude?

When a multiple regression model is used for estimating the mean of the dependent variable and for predicting a new value of y, which will be narrower—the confidence interval for the mean or the prediction interval for the new y-value? Why?

Question: Failure times of silicon wafer microchips. Refer to the National Semiconductor study of manufactured silicon wafer integrated circuit chips, Exercise 12.63 (p. 749). Recall that the failure times of the microchips (in hours) was determined at different solder temperatures (degrees Celsius). The data are repeated in the table below.

  1. Fit the straight-line modelEy=β0+β1xto the data, where y = failure time and x = solder temperature.

  2. Compute the residual for a microchip manufactured at a temperature of 149°C.

  3. Plot the residuals against solder temperature (x). Do you detect a trend?

  4. In Exercise 12.63c, you determined that failure time (y) and solder temperature (x) were curvilinearly related. Does the residual plot, part c, support this conclusion?

Comparing private and public college tuition. According to the Chronicle of Higher Education Almanac, 4-year private colleges charge, on average, five times as much for tuition and fees than 4-year public colleges. In order to estimate the true difference in the mean amounts charged for an academic year, random samples of 40 private colleges and 40 public colleges were contacted and questioned about their tuition structures.

  1. Which of the procedures described in Chapter 8 could be used to estimate the difference in mean charges between private and public colleges?

  2. Propose a regression model involving the qualitative independent variable type of college that could be used to investigate the difference between the means. Be sure to specify the coding scheme for the dummy variable in the model.

  3. Explain how the regression model you developed in part b could be used to estimate the difference between the population means.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free