Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Blood cocaine concentration (mg/L) was determined both for a sample of individuals who had died from cocaine-induced excited delirium (ED) and for a sample of those who had died from a cocaine overdose without excited delirium; survival time for people in both groups was at most 6 hours. The accompanying data was read from a comparative boxplot in the article “Fatal Excited Delirium Following Cocaine Use” (J.

of Forensic Sciences, 1997: 25–31).

ED0 0 0 0 .1 .1 .1 .1 .2 .2 .3 .3

.3 .4 .5 .7 .8 1.0 1.5 2.7 2.8

3.5 4.0 8.9 9.2 11.7 21.0

Non-ED0 0 0 0 0 .1 .1 .1 .1 .2 .2 .2

.3 .3 .3 .4 .5 .5 .6 .8 .9 1.0

1.2 1.4 1.5 1.7 2.0 3.2 3.5 4.1

4.3 4.8 5.0 5.6 5.9 6.0 6.4 7.9

8.3 8.7 9.1 9.6 9.9 11.0 11.5

12.2 12.7 14.0 16.6 17.8

a. Determine the medians, fourths, and fourth spreads for the two samples.

b. Are there any outliers in either sample? Any extreme outliers?

c. Construct a comparative boxplot, and use it as a basis for comparing and contrasting the ED and non-ED samples.

Short Answer

Expert verified

a. For ED,

The median value is 0.4.

The lower fourth is 0.1.

The Upper fourth is 2.8.

The fourth spread is 2.7.

For non-ED,

The median value is 1.6.

The lower fourth is 0.3.

The Upper fourth is 7.9.

The fourth spread is 7.6.

b. For ED,

The outliers are 8.9, 9.2,11.7 and 21.

The extreme outliers are 11.7 and 21.

For non-ED,

There are no outliers.

c. The box plot is represented as,

Step by step solution

01

Given information

The data for the individuals who had diet from cocaine-induced excited delirium (ED) and those who had died from a cocaine overdose without excited delirium is provided.

The sample size for ED samples is 27.

The sample size for non-ED samples is 50.

02

Computing the median

Let x represents the ED samples.

Let y represent the non-ED samples.

a.

The data is arranged in ascending order.

For x,

For the odd number of observations, the median value is computed as,

\(\begin{aligned}\tilde x &= {\left( {\frac{{n + 1}}{2}} \right)^{th}}ordered\;value\\ &= {\left( {\frac{{27 + 1}}{2}} \right)^{th}}ordered\;value\\ &= {\left( {14} \right)^{th}}ordered\;value\end{aligned}\)

Thus, the median value of x is 0.4.

For y,

For the even number of observations, the median value is computed as,

\(\begin{aligned}\tilde y &= average\;of\;{\left( {\frac{n}{2}} \right)^{th}}and\;{\left( {\frac{n}{2} + 1} \right)^{th}}\;ordered\;values\\ &= average\;of\;{\left( {\frac{{50}}{2}} \right)^{th}}and\;{\left( {\frac{{50}}{2} + 1} \right)^{th}}\;ordered\;values\\ &= average\;of\;{\left( {25} \right)^{th}}and\;{\left( {26} \right)^{th}}\;ordered\;values\\ &= \frac{{1.5 + 1.7}}{2}\\ &= 1.6\end{aligned}\)

Thus, the median value is 1.6.

03

Computing the fourths and fourth spread

For ED,

\(\begin{aligned}{l}{n_{smallest}} &= 13\\{n_{{\rm{largest}}}} &= 13\end{aligned}\)

The lower fourth is computed as,

\(\begin{aligned}\frac{{{n_{smallest}} + 1}}{2} &= \frac{{13 + 1}}{2} \\ &= {7^{th}}value \\ \end{aligned} \)

The 7th value of the smallest half is 0.1.

This implies that the lower fourth is 0.1.

The upper fourth is computed as,

\(\begin{aligned}\frac{{{n_{{\rm{largest}}}} + 1}}{2} &= \frac{{13 + 1}}{2}\\ &= {7^{th}}value\end{aligned}\)

The 7th value of the largest half is 2.8.

This implies that the upper fourth is 2.8.

The fourth spread is computed as,

\(\begin{aligned}{f_s} &= {\rm{Upper}}\,{\rm{fourth}} - {\rm{Lower}}\;{\rm{fourth}}\\ &= 2.8 - 0.1\\ &= 2.7\end{aligned}\)

Therefore, the fourth spread is 2.7.

or non-ED,

\(\begin{aligned}{l}{n_{smallest}} &= 25\\{n_{{\rm{largest}}}} &= 25\end{aligned}\)

The lower fourth is computed as,

\(\begin{aligned}\frac{{{n_{smallest}} + 1}}{2} &= \frac{{25 + 1}}{2}\\ &= {13^{th}}value\end{aligned}\)

The 13th value of the smallest half is 0.3.

This implies that the lower fourth is 0.3.

The upper fourth is computed as,

\(\begin{aligned}\frac{{{n_{{\rm{largest}}}} + 1}}{2} &= \frac{{25 + 1}}{2}\\ &= {13^{th}}value\end{aligned}\)

The 13th value of the largest half is 7.9.

This implies that the upper fourth is 7.9.

The fourth spread is computed as,

\(\begin{aligned}{f_s} &= {\rm{Upper}}\,{\rm{fourth}} - {\rm{Lower}}\;{\rm{fourth}}\\ &= 7.9 - 0.3\\ &= 7.6\end{aligned}\)

Therefore, the fourth spread is 7.6.

04

Checking the outliers

b.

Referring to part a,

For ED,

The median value is 0.4.

The lower fourth is 0.1.

The Upper fourth is 2.8.

The fourth spread is 2.7.

For non-ED,

The median value is 1.6.

The lower fourth is 0.3.

The Upper fourth is 7.9.

The fourth spread is 7.6.

Any observation farther than 1.5\({f_s}\) from the closest fourth is an outlier.

An outlier is extreme if it is more than 3\({f_s}\) from the nearest fourth, otherwise, it is mild.

For x,

The calculations are as follows,

\(\begin{aligned}Lower\;fourth - 1.5{f_s} &= 0.1 - \left( {1.5*2.7} \right)\\ &= 0.1 - 4.05\\ &= - 3.95\end{aligned}\)

\(\begin{aligned}Upper\;fourth + 1.5{f_s} &= 2.8 + \left( {1.5*2.7} \right)\\ &= 2.8 + 4.05\\ &= 6.85\end{aligned}\)

It can be observed that there are no values that are less than -3.95.but there are few values that exceeds 6.85; they are, 8.9, 9.2,11.7 and 21.

For extreme outlier,

\(\begin{aligned}Upper\;fourth + 3{f_s} &= 2.8 + \left( {3*2.7} \right)\\ &= 2.8 + 8.1\\ &= 10.9\end{aligned}\)

Therefore, the extreme outliers are 11.7 and 21.

For y,

The calculations are as follows,

\(\begin{aligned}Lower\;fourth - 1.5{f_s} &= 0.3 - \left( {1.5*7.6} \right)\\ < 0\end{aligned}\)

\(\begin{aligned}Upper\;fourth + 1.5{f_s} &= 7.9 + \left( {1.5*7.6} \right)\\ &= 7.9 + 11.4\\ &= 19.3\end{aligned}\)

It can be observed that there are no values that are negative and no values are greater than 19.3.

This implies that are no outliers.

05

Construction of a comparative boxplot

Referring to parts a and b,

The five-number summary is,

For ED,

The smallest observation is 0.

The median value is 0.4.

The lower fourth is 0.1.

The Upper fourth is 2.8.

The fourth spread is 2.7.

The largest observation is 21.

For non-ED,

The smallest observation is 0.

The median value is 1.6.

The lower fourth is 0.3.

The Upper fourth is 7.9.

The fourth spread is 7.6.

The largest observation is 17.8.

The steps to construct a boxplot are as follows,

1. Compute the values of the five-number summary (smallest value, lower fourth, median, upper fourth, largest value).

2. Construct a line segment from the smallest value to the largest value of the dataset.

3. Construct a rectangular box from the lower fourth to the upper fourth and draw a line in the box at the median value.

The boxplot is represented as,

The red-colored boxplot represents ED, and the green-colored boxplot represents non-ED.

The comparison is as follows,

1. There are four outliers for ED and no outliers for non-ED samples.

2. The distribution is positively skewed for both samples.

3. There is less variability in ED than in the non-ED sample.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Consider the following information on ultimate tensile strength (lb/in) for a sample of n = 4 hard zirconium copper wire specimens (from “Characterization Methods forFine Copper Wire,” Wire J. Intl., Aug., 1997: 74–80):mean = 76,831 s = 180 smallest = 76,683 largest = 77,048

Determine the values of the two middle sample observations (and don’t do it by successive guessing!)

The weekly demand for propane gas (in \({\rm{1000s}}\) of gallons) from a particular facility is an \({\rm{rv}}\) \({\rm{X}}\) with pdf

\({\rm{f(x) = }}\left\{ {\begin{array}{*{20}{c}}{{\rm{2}}\left( {{\rm{1 - }}\frac{{\rm{1}}}{{{{\rm{x}}^{\rm{2}}}}}} \right)}&{{\rm{1}} \le {\rm{x}} \le {\rm{2}}}\\{\rm{0}}&{{\rm{ otherwise }}}\end{array}} \right.\)

a. Compute the cdf of \({\rm{X}}\).

b. Obtain an expression for the \({\rm{(100p)th}}\) percentile. What is the value of \({\rm{\tilde \mu }}\)?

c. Compute \({\rm{E(X)}}\) and \({\rm{V(X)}}\).

d. If \({\rm{1}}{\rm{.5}}\) thousand gallons are in stock at the beginning of the week and no new supply is due in during the week, how much of the \({\rm{1}}{\rm{.5}}\) thousand gallons is expected to be left at the end of the week? (Hint: Let \({\rm{h(x) = }}\) amount left when demand \({\rm{ = x}}\).)

The propagation of fatigue cracks in various aircraft parts has been the subject of extensive study in recent years. The accompanying data consists of propagation lives(flight hours/\({\bf{1}}{{\bf{0}}^{\bf{4}}}\)) to reach a given crack size in fastener holes intended for use in military aircraft (“Statistical Crack Propagation in Fastener Holes Under Spectrum Loading,” J. Aircraft,1983: 1028–1032):

.736 .863 .865 .913 .915 .937 .983 1.007

1.011 1.064 1.109 1.132 1.140 1.153 1.253 1.394

  1. Compute and compare the values of the sample mean and median.
  2. By how much could the largest sample observation be decreased without affecting the value of the median?

The May 1, 2009, issue of the Mont clarian reported the following home sale amounts for a sample of homes in Alameda, CA that were sold the previous month (1000s of $):

590 815 575 608 350 1285 408 540 555 679

  1. Calculate and interpret the sample mean and median.
  2. Suppose the 6th observation had been 985 rather than 1285. How would the mean and median change?
  3. Calculate a 20% trimmed mean by first trimming the two smallest and two largest observations.
  4. Calculate a 15% trimmed mean.

The article “Reliability-Based Service-Life Assessment of Aging Concrete Structures” (J. Structural Engr.,\({\rm{1993: 1600 - 1621}}\)) suggests that a Poisson process can be used to represent the occurrence of structural loads over time. Suppose the mean time between occurrences of loads is\({\rm{.5}}\)year. a. How many loads can be expected to occur during a\({\rm{2}}\)-year period? b. What is the probability that more than five loads occur during a\({\rm{2}}\)-year period? c. How long must a time period be so that the probability of no loads occurring during that period is at most\({\rm{.1}}\)?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free