Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Box plots and the standard normal distribution. What relationship exists between the standard normal distribution and the box-plot methodology (Section 2.8) for describing distributions of data using quartiles? The answer depends on the true underlying probability distribution of the data. Assume for the remainder of this exercise that the distribution is normal.

a. Calculate the values of the standard normal random variable z, call them zL and zU, that correspond to the hinges of the box plot—that is, the lower and upper quartiles, QL and QU—of the probability distribution.

b. Calculate the zvalues that correspond to the inner fences of the box plot for a normal probability distribution.

c. Calculate the zvalues that correspond to the outer fences of the box plot for a normal probability distribution.

d. What is the probability that observation lies beyond the inner fences of a normal probability distribution? The outer fences?

e. Can you better understand why the inner and outer fences of a box plot are used to detect outliers in a distribution? Explain.

Short Answer

Expert verified

a. The lower and upper quartile is -0.67449 and 0.67449

b. The zvalues that correspond to the inner fences of the box plot are -2.697959 and 2.697959

c. The zvalues that correspond to the outer fences of the box plot are -4.72143 and 4.72143

d. The probability of an observation falling outside of inner fences is 0.006977 and outer fences is 0

e. The probability is very low for an observation to fall outside of these fences

Step by step solution

01

Given information

The given distribution is a normal distribution

02

 Calculating the lower and upper quantile

a.

The lower quartile is 25th percentile

Let zL be the standard normal random variable that corresponds to QL

i.e,Pz<zL=0.25ΦzL=0.25zL=Φ-10.25zL=-0.67449

So the lower quantile is -0.67449

The upper quantile is 75th percentile

Let zQ be the standard normal random variable corresponds to QU

i.e,Pz<zU=0.75ΦzU=0.75zU=Φ-10.75zU=0.67449

So the upper quartile is 0.67449

03

 Calculating the inner fences of the box

b.IQR=QU-QL=0.67449--0.67449=1.34898ThelowerinnerfenceisLIF=QL-1.5xIQRLIF=QL-1.5xIQR=QL-1.5xQU-QL=-0.67449-1.5x1.34898=-2.697959TheupperinnerfenceisUIF=QU+1.5xIQRUIF=QU+1.5xIQR=QU+1.5QU-QL=0.67449+1.5x1.34898=2.697959Sothelowerinnerfenceis-2.697959andtheupperinnerfenceis2.697959

04

 Calculating the outer fences of the box

C.IQR=QU-QL=0.67449--0.67449=1.34898

The lower outer fence is LOF=Q1-3xIQR

LOF=QL-3xIQR=QL-3xQU-QL=-0.67449-3x1.34898=-4.72143

The upper outer fence isUOF=QU+3xIQR

UOF=QL+3xIQR=QU+3xQU-QL=0.67449+3x1.34898=4.72143

So the lower outer fence is -4.72143 and upper outer fence is 4.72143

05

 Calculating the probabilities

d.

The probability that observation lies beyond the inner fences of a normal probability distribution is,

I.E,Pz<-2.697959+Pz<-2.697959=1-Pz<-2.697959+1-z<-2.697959=2-2Pz<-2.697959=2-2Φ2.697959=2-2X0.996512=0.006977

So, the probability is 0.006977

The probability that an observation lies beyond the outer fences of a normal probability distribution is,

I.E,Pz<-2.697959+Pz>4.72143=1-Pz<4.72143+1-z<4.72143=2-2Pz<4.72143=2-2Φ4.72143=2-2X0.999999𝆏0

So the probability is 0

06

Explanation

The inner and outer fences of box plot are used to detect outliers in a distribution in the following ways:

Values that are beyond the inner fences are deemed potential outliers because they are extreme values that represent relatively rare occurrences. In fact, for a normal probability distribution, less than 1% of the observations are expected to fall outside of inner fences.

Measurements that fall beyond the outer fences are very extreme measurements that require special analysis. Since less than one-hundredth of 1% (0.1% or 0.001) of the measurements from a normal distribution are expected to fall beyond the outer fences, these measurements are considered to be outliers.

From part(d) we clearly understand why the inner and outer fences of the box plot are used to detect outliers in a distribution.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Detecting anthrax. Researchers at the University of SouthFlorida Center for Biological Defense have developed asafe method for rapidly detecting anthrax spores in powdersand on surfaces (USF Magazine, Summer 2002). Themethod has been found to work well even when there arevery few anthrax spores in a powder specimen. Considera powder specimen that has exactly 10 anthrax spores.Suppose that the number of anthrax spores in the sampledetected by this method follows an approximate uniformdistribution between 0 and 10.

a. Find the probability that 8 or fewer anthrax spores are detected in the powder specimen.

b. Find the probability that between 2 and 5 anthrax spores are detected in the powder specimen.

If a population data set is normally distributed, what isthe proportion of measurements you would expect to fallwithin the following intervals?

a.μ±σb.μ±2σc.μ±3σ

Mean shifts on a production line. Six Sigma is a comprehensive approach to quality goal setting that involves statistics. An article in Aircraft Engineering and Aerospace Technology (Vol. 76, No. 6, 2004) demonstrated the use of the normal distribution in Six Sigma goal setting at Motorola Corporation. Motorola discovered that the average defect rate for parts produced on an assembly line varies from run to run and is approximately normally distributed with a mean equal to 3 defects per million. Assume that the goal at Motorola is for the average defect rate to vary no more than 1.5 standard deviations above or below the mean of 3. How likely is it that the goal will be met?

Stock market participation and IQ. Refer to The Journal of Finance (December 2011) study of whether the decision to invest in the stock market is dependent on IQ, Exercise 3.46 (p. 182). Recall that an IQ score (from a low score of 1 to a high score of 9) was determined for each in a sample of 158,044 Finnish citizens. Also recorded was whether or not the citizen invested in the stock market. The accompanying table gives the number of Finnish citizens in each IQ score/investment category. Which group of Finnish citizens (market investors or noninvestors) has the highest average IQ score?

IQ Score

Invest in market

No investment

Totals

1

893

4659

5552

2

1340

9409

10749

3

2009

9993

12002

4

5358

19682

25040

5

8484

24640

33124

6

10270

21673

31943

7

6698

11260

17958

8

5135

7010

12145

9

4464

5067

9531

Totals

44651

113393

158044

Stock market. Give an example of a continuous random variable that would be of interest to a stockbroker.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free