Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Independent random samples from normal populations produced the results shown in the next table.

Sample 1


Sample 2

1.23.11.72.83.0

4.22.73.63.9

a. Calculate the pooled estimate of σ2.

b. Do the data provide sufficient evidence to indicate that μ21? Test using α=.10.

c. Find a 90% confidence interval for (μ1μ2).

d. Which of the two inferential procedures, the test of hypothesis in part b or the confidence interval in part c, provides more information about (μ1μ2)?

Short Answer

Expert verified

The pooled variance is a rough approximation of the shared variance.

Step by step solution

01

Step-by-Step Solution Step 1: Definition of the pooled estimator.

The pooled estimatoris one that is derived by merging data from two or more separate samples from groups that are thought to have a similar mean. The pooled variance is a technique for estimating common variance.

The formula to find pooled estimator of the variance of two samples is:

sp2=(n11)s12+(n21)s22n1+n22

02

(a) Calculate a pooled estimate of variance.

Mean of sample 1 = x¯1=1.2+3.1+1.7+2.8+3.05=2.36

role="math" localid="1652802728137" s12=1n11i=1n[xix¯]2=151[(1.22.36)2+(3.12.36)2+(1.72.36)2+(2.82.36)2+(3.02.36)2]=14[1.3456+0.5476+0.4356+0.1936+0.4096]=14(2.932)=0.733

Mean of sample 2 = x¯2=4.2+2.7+3.6+3.94=3.6

s22=1n21i=1n[xix¯]2=141[(4.23.6)2+(2.73.6)2+(3.63.6)2+(3.93.6)2]=13[0.36+0.81+0+0.09]=13(1.26)=0.42

sp2=(51)0.733+(41)0.425+42=2.932+1.267=0.6

Therefore, the pooled estimate of variance is 0.6

03

(b) Conduct a t-test.

Null Hypothesis,H01μ2and

Alternate Hypothesis,Ha12

The level of significance is 0.10.

Degreeoffreedom=n1+n22=5+42=7

From the t-distribution table, the critical value at 0.10the level of the significance for degrees of freedom about the right-tailed test is -1.415.

t=x1¯x¯2sp21n1+1n2=2.363.60.615+14=1.240.6(0.2+0.25)=1.240.52=-2.38

As, the value of t<1.415, the null hypothesis should be rejected.

Therefore, the data provide sufficient evidence to indicate that μ2>μ1.

04

(c) Find confidence interval.

The 90% confidence interval for the difference in means

=(x¯1x¯2)±tα/2×s12n1+s22n2=(2.363.6)±1.895×0.7335+0.424=(1.24)±(1.895×0.502)=1.24±0.95

Therefore, the confidence interval for the difference of means is2.19to0.29

05

(d) State the conclusion.

The confidence interval tells us the specific limit within which the difference between the population means is expected to lie with 90% confidence, whereas the hypothesis testing presents the situation where we can tell that μ2>μ1without specifying any value of the difference between the population means.

Therefore, the confidence interval provides more information μ1μ2.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Given that x is a random variable for which a Poisson probability distribution provides a good approximation, use statistical software to find the following:

a.P(x2) when λ=1

b.P(x2) when λ=2

c.P(x2) when λ=3

d. What happens to the probability of the event {x2} as λ it increases from 1 to 3? Is this intuitively reasonable?

Question: Impact of race on football card values. Refer to the Electronic Journal of Sociology (2007) study of the Impact of race on the value of professional football players’ “rookie” cards, Exercise 12.72 (p. 756). Recall that the sample consisted of 148 rookie cards of NFL players who were inducted into the Football Hall of Fame (HOF). The researchers modelled the natural logarithm of card price (y) as a function of the following independent variables:

Race:x1=1ifblack,0ifwhiteCardavailability:x2=1ifhigh,0iflowCardvintage:x3=yearcardprintedFinalist:x4=naturallogarithmofnumberoftimesplayeronfinalHOFballotPosition-QB::x5=1ifquarterback,0ifnotPosition-RB:x7=1ifrunningback,0ifnotPosition-WR:x8=1ifwidereceiver,0ifnotPosition-TEx9=1iftightend,0ifnotPosition-DL:x10=1ifdefensivelineman,0ifnotPosition-LB:x11=1iflinebacker,0ifnotPosition-DB:x12=1ifdefensiveback,0ifnot

[Note: For position, offensive lineman is the base level.]

  1. The model E(y)=β0+β1x1+β2x2+β3x3+β4x4+β5x5+β6x6+β7x7+β8x8+β9x9+β10x10+β11x11+β12x12 was fit to the data with the following results:R2=0.705,Ra2=0.681,F=26.9.Interpret the results, practically. Make an inference about the overall adequacy of the model.
  2. Refer to part a. Statistics for the race variable were reported as follows:β^1=-0.147,sβ^1=-0.145,t=-1.014,p-value=0.312 .Use this information to make an inference about the impact of race on the value of professional football players’ rookie cards.
  3. Refer to part a. Statistics for the card vintage variable were reported as follows:β^3=-0.074,sβ^3=0.007,t=-10.92,p-value=.000.Use this information to make an inference about the impact of card vintage on the value of professional football players’ rookie cards.
  4. Write a first-order model for E(y) as a function of card vintage x3and position x5-x12that allows for the relationship between price and vintage to vary depending on position.

Question: Forecasting daily admission of a water park. To determine whether extra personnel are needed for the day, the owners of a water adventure park would like to find a model that would allow them to predict the day’s attendance each morning before opening based on the day of the week and weather conditions. The model is of the form

where,

y = Daily admission

x1 = 1 if weekend

0 otherwise

X2 = 1 if sunny

0 if overcast

X3 = predicted daily high temperature (°F)

These data were recorded for a random sample of 30 days, and a regression model was fitted to the data.

The least squares analysis produced the following results:

with

  1. Interpret the estimated model coefficients.
  2. Is there sufficient evidence to conclude that this model is useful for predicting daily attendance? Use α = .05.
  3. Is there sufficient evidence to conclude that the mean attendance increases on weekends? Use α = .10.
  4. Use the model to predict the attendance on a sunny weekday with a predicted high temperature of 95°F.
  5. Suppose the 90% prediction interval for part d is (645, 1,245). Interpret this interval.

Buy-side vs. sell-side analysts' earnings forecasts. Refer to the Financial Analysts Journal (Jul. /Aug. 2008) study of financial analysts' forecast earnings, Exercise 2.86 (p. 112). Recall that data were collected from 3,526 buy-side analysts and 58,562forecasts made by sell-side analysts, and the relative absolute forecast error was determined for each. The mean and standard deviation of forecast errors for both types of analysts are given in the table.

a. Construct a 95% confidence interval for the difference between the mean forecast error of buy-side analysts and the mean forecast error of sell-side analysts.

b. Based on the interval, part a, which type of analysis has the greater mean forecast error? Explain.

c. What assumptions about the underlying populations of forecast errors (if any) are necessary for the validity of the inference, part b?

Question: Promotion of supermarket vegetables. A supermarket chain is interested in exploring the relationship between the sales of its store-brand canned vegetables (y), the amount spent on promotion of the vegetables in local newspapers(x1) , and the amount of shelf space allocated to the brand (x2 ) . One of the chain’s supermarkets was randomly selected, and over a 20-week period, x1 and x2 were varied, as reported in the table.

Week

Sales, y

Advertising expenses,

Shelf space,

Interaction term,

1

2010

201

75

15075

2

1850

205

50

10250

3

2400

355

75

26625

4

1575

208

30

6240

5

3550

590

75

44250

6

2015

397

50

19850

7

3908

820

75

61500

8

1870

400

30

12000

9

4877

997

75

74775

10

2190

515

30

15450

11

5005

996

75

74700

12

2500

625

50

31250

13

3005

860

50

43000

14

3480

1012

50

50600

15

5500

1135

75

85125

16

1995

635

30

19050

17

2390

837

30

25110

18

4390

1200

50

60000

19

2785

990

30

29700

20

2989

1205

30

36150

  1. Fit the following model to the data:yβ0+β1x1+β2x2+β3x1x2+ε
  2. Conduct an F-test to investigate the overall usefulness of this model. Useα=.05 .
  3. Test for the presence of interaction between advertising expenditures and shelf space. Useα=.05 .
  4. Explain what it means to say that advertising expenditures and shelf space interact.
  5. Explain how you could be misled by using a first-order model instead of an interaction model to explain how advertising expenditures and shelf space influence sales.
  6. Based on the type of data collected, comment on the assumption of independent errors.
See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free