Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

For each of the following data sets, construct a graphical display of the data distribution and then indicate what summary measures you would use to describe center and spread. a. The following are data on weekend exercise time for 20 females consistent with summary quantities given in the paper "An Ecological Momentary Assessment of the Physical Activity and Sedentary Behaviour Patterns of University Students" (Health Education Journal [2010]: \(116-125)\) Female-Weekend \(\begin{array}{lrrrrrr}84.0 & 27.0 & 82.5 & 0.0 & 5.0 & 13.0 & 44.5 \\ 3.0 & 0.0 & 14.5 & 45.5 & 39.5 & 6.5 & 34.5 \\ 0.0 & 14.5 & 40.5 & 44.5 & 54.0 & 0.0 & \end{array}\) b. The accompanying data are consistent with summary statistics that appeared in the paper "Shape of Glass and Amount of Alcohol Poured: Comparative Study of Effect of Practice and Concentration" (British Medical Journal [2005]: 1512-1514). Data represent the actual amount (in \(\mathrm{ml}\) ) poured into a tall, slender glass for individuals asked to pour a "shot" of alcohol \((44.3 \mathrm{ml}\) or 1.5 ounces). \(\begin{array}{lllllll}44.0 & 49.6 & 62.3 & 28.4 & 39.1 & 39.8 & 60.5 \\ 73.0 & 57.5 & 56.5 & 65.0 & 56.2 & 57.7 & 73.5 \\ 66.4 & 32.7 & 40.4 & 21.4 & & & \end{array}\) c. The accompanying data are from a graph that appeared in the paper "Ladies First? A Field Study of Discrimination in Coffee Shops" (Applied Economics [April, 2008]). The data are the wait times (in seconds) between orderingand receiving coffee for 19 female customers at a Boston coffee shop. \(\begin{array}{lrrrrrr}60 & 80 & 80 & 100 & 100 & 100 & 120 \\ 120 & 120 & 140 & 140 & 150 & 160 & 180 \\ 200 & 200 & 220 & 240 & 380 & & \end{array}\)

Short Answer

Expert verified
For Dataset A, use median as the measure of center and interquartile range for spread. For Dataset B, use mean for measure of center and standard deviation for spread. For Dataset C, use median as the measure of center and interquartile range for spread.

Step by step solution

01

Analyze Dataset A

For this dataset, to describe the center, compute the median of the data points as there are observable outliers i.e., multiple zero values which would impact the mean significantly and may not provide an accurate measure of center. Compute the interquartile range to measure the spread as it provides a better set of values considering the dataset's structure.
02

Analyze Dataset B

The dataset may have some outliers (e.g., 73.0 and 73.5), but overall the distribution seems fairly regular. Hence, the mean can be computed to represent the data's center. The standard deviation can be used to measure the spread as it provides an understanding of the variation in the amount of alcohol poured.
03

Analyze Dataset C

Looking at the data, it seems that there is a wide disparity in wait times, with some values significantly higher than others (e.g., 380). Therefore, it would be more appropriate to use the median as the measure of center and the interquartile range for spread as it mitigates the impact of extreme values.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Data on tipping percent for 20 restaurant tables, consistent with summary statistics given in the paper "Beauty and the Labor Market: Evidence from Restaurant Servers" (unpublished manuscript by Matt Parrett, 2007 ), are: \(\begin{array}{rrrrrrrr}0.0 & 5.0 & 45.0 & 32.8 & 13.9 & 10.4 & 55.2 & 50.0 \\\ 10.0 & 14.6 & 38.4 & 23.0 & 27.9 & 27.9 & 105.0 & 19.0\end{array}\) \(\begin{array}{cccc}10.0 & 32.1 & 11.1 & 15.0\end{array}\)

3.49 The Los Angeles Times (July \(17,\) 1995) reported that for a sample of 364 lawsuits in which punitive damages were awarded, the median damage award was \(\$ 50,000,\) and the mean was \(\$ 775,000 .\) What does this suggest about the distribution of values in the sample?

The data below are manufacturing defects per 100 cars for the 33 brands of cars sold in the United States (USA Today, June 16,2010 ). Many of these values are larger than 100 because one car might have many defects. \(\begin{array}{rrrrrrr}86 & 111 & 113 & 114 & 111 & 111 & 122 \\ 130 & 93 & 126 & 95 & 102 & 107 & 109 \\ 130 & 129 & 126 & 170 & 88 & 106 & 114 \\ 87 & 113 & 133 & 146 & 111 & 83 & 135 \\ 110 & 114 & 121 & 122 & 117 & & \end{array}\) Use these data to construct a boxplot. Write a few sentences describing the important characteristics of the boxplot.

The San Luis ObispoTelegram-Tribune(October1,1994) reported the following monthly salaries for supervisors from six different counties: \(\$ 5,354\) (Kern), \(\$ 5,166\) (Monterey), \(\$ 4,443\) (Santa Cruz), \(\$ 4,129\) (Santa Barbara), \(\$ 2,500\) (Placer), and \$2,220 (Merced). San Luis Obispo County supervisors are supposed to be paid the average of the two counties in the middle of this salary range. Which measure of center determines this salary, and what is its value? Find the value of the other measure of center featured in this chapter. Why is it not as favorable to the San Luis Obispo County supervisors (although it might appeal to taxpayers)?

The paper "Caffeinated Energy Drinks-A Growing Problem" (Drug and Alcohol Dependence [2009]: 1-10) reported caffeine per ounce for 8 top-selling energy drinks and for 11 highcaffeine energy drinks: Top Selling Energy Drinks $$ \begin{array}{llll} 9.6 & 10.0 & 10.0 & 9.0 \end{array} $$ 9.5 High-Caffeine Energy Drinks \(21.0 \quad 2\) 25.0 15.0 ! \(21 .\) \(\begin{array}{ll}5 & 35 .\end{array}\) 30.0 \(31.3 \quad 3\) \(33.3 \quad 11.9 \quad 16.3\) The mean caffeine per ounce is clearly higher for the highcaffeine energy drinks, but which of the two groups of energy drinks is the most variable? Justify your choice.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free