Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Sanitation inspection of cruise ships. Refer to Exercise 2.81 (p. 111) and the data on the sanitation levels of passenger cruise ships.

  1. Use the box plot method to detect any outliers in the data set.
  2. Use the z-score method to detect any outliers in the data set.
  3. Do the two methods agree? If not, explain why

Short Answer

Expert verified
  1. 105.5

  2. This approach considers any point with a z-score more than 2 or less than -2 an outlier.

  3. As a result, no outliers are discovered using the z-score approach.

Step by step solution

01

(a) Box plot method

We find the 5-point summary, as given below, and afterward calculate the IQR as well as outlier boundaries.

The five-point summary:

min=691stquartile=93median=963rdquartile=98max=100

skewness=3mean-medianσ=394.4409-965.3352=0.8766869

IQR=Q3-Q1

=98-93

=5

To find the outlier:

lowerlimit=Q1-1.5IQR=93-(1.5×5)=85.5Higherlimit=Q1+1.5IQR=93+(1.5×5)=105.5

Thus, any worth less than 85.5 or more than 105.5 is an outlier.

As we can see, some points are less than 85.5 and thus are considered outliers.

02

(b) z-score method

As illustrated, we compute the z-score for every observation.

This approach considers any point with a z-score more than 2 or less than -2 an outlier.

However, no outlier is discovered using this approach because no point has a z-score more than or even less than 2.

03

(c) Two methods

In the box plot approach, they only examine the centre 50% of the information to calculate the IQR, which is used as a measurement of the information's dispersion. When we plot the data as a histogram, we see that it is skewed to the left. As a result, employing the median as a central tendency measurement is more suitable in this circumstance.

One utilized the standard deviation and mean from the complete dataset in the z-score approach. Both are heavily influenced by outlier estimates and thus tend to distort the standard deviation and mean readings. As a result, no outliers are discovered using the z-score approach.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Calculate the range, variance, and standard deviation for the following samples:

a.4, 2, 1, 0, 1

b.1, 6, 2, 2, 3, 0, 3

c.8, -2, 1, 3, 5, 4, 4, 1, 3, 3

d.0, 2, 0, 0, -1, 1, -2, 1, 0, -1, 1, -1, 0, -3,-2, -1, 0, 1

Question: The output from a statistical software package indicates that the mean and standard deviation of a data set consisting of 200 measurements are \(1,500 and \)300, respectively.

a.What are the units of measurement of the variable of interest? Based on the units, what type of data is this: quantitative or qualitative?

b.What can be said about the number of measurements between \(900 and \)2,100? Between \(600 and \)2,400? Between \(1,200 and \)1,800? Between \(1,500 and \)2,100?

Performance of stock screeners.Investment companies provide their clients with automated tools—called stock screeners—to help them select a portfolio of stocks to invest in. The American Association of Individual Investors (AAII) provides statistics on stock screeners at its Website, www.aaii.com. The next table lists the annualized percentage return on investment (as compared to the Standard & Poor’s 500 Index) for 13 randomly selected stock screeners. (Note:A negative annualized return reflects a stock portfolio that performed worse than the S&P 500.)

(9.0, -.1, -1.6, 14.6, 16.0, 7.7, 19.9, 9.8, 3.2, 24.8, 17.6, 10.7, 9.1)

a.Compute the mean for the data set. Interpret its value.

b.Compute the median for the data set. Interpret its value.

Improving SAT scores.The National Education Longitudinal Survey (NELS) tracks a nationally representative sample of U.S. students from eighth grade through high school and college. Research published in Chance (Winter 2001) examined the Standardized Assessment Test (SAT) scores of 265 NELS students who paid a private tutor to help them improve their scores. The table summarizes the changes in both the SAT–Mathematics and SAT–Verbal scores for these students.

SAT–Math

SAT–Verbal

Mean change in score

19

7

Standard deviation of score changes

65

49

a.Suppose one of the 265 students who paid a private tutor is selected at random. Give an interval that is likely to contain this student’s change in the SAT–Math score.

b.Repeat part afor the SAT–Verbal score.

c.Suppose the selected student increased his score on one of the SAT tests by 140 points. Which test, the SAT– Math or SAT–Verbal, is the one most likely to have the 140-point increase? Explain.

Calculate the range, variance, and standard deviation for the following samples:

a.39, 42, 40, 37, 41

b.100, 4, 7, 96, 80, 3, 1, 10, 2

c.100, 4, 7, 30, 80, 30, 42, 2

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free