Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Best-paid CEOs.Refer to Glassdoor Economic Research firm’s 2015 ranking of the 40 best-paid CEOs in Table 2.1 (p. 65). Recall that data were collected on a CEO’s current salary, age, and the ratio of salary to a typical worker’s pay at the firm.

a.Create a scatterplot to relate a CEO’s ratio of salary to worker pay to the CEO’s age. Comment on the strength of the association between these two variables.

b.Conduct an outlier analysis of the ratio variable. Identify the highly suspect outlier in the data.

c.Remove the highly suspect outlier from the data and recreate the scatterplot of part a. What do you observe?

Short Answer

Expert verified

a. The graph is given below:


No strong association

b.Highly suspect outlier = 1951

c. No major change

Step by step solution

01

Creating a Scatterplot

The graph is given below:

There is no visible strong association between the CEO’s age and the ratio of the CEO’s salary to that of a typical worker. The data is clustered to the left of the graph except for a few scattered points.

02

Conducting an outlier analysis and identifying the highly suspect outliers

We will use the box plot method to conduct an outlier analysis on the ratio variable.

We will first calculate the quartiles, IQR, Inner, and Outer fences.

Arranging the data in ascending order,

Q1=N+14=40+14=414=10.25th

Term=483

Q2=N+12=40+12=412=20.25th

Term=536.5

Q3=3N+14=340+14=1234=30.75th

Term=644.25

IQR=QU–QL=644.25–483=161.25

LowerInnerFence=QL1.5IQR=4831.5161.25=483241.875=241.125

UpperInnerFence=QU+1.5IQR=644.25+1.5161.25=644.25+241.875=886.125

Now we will plot these,

The graph shows thatthe outliers are 1951, 1522, 1192, 1133, and 939.

To find which of these are highly suspect outliers (with a z-score > 3), we will check the z-score of these outliers.

First, we will calculate the mean and standard deviation of the data.

Mean=SumofallobservationsNo.ofobservations=2567040=641.75

Variance=χ2n1=386417039=99081.28

StandardDeviation=Variance=99081.28=314.77

Mean = 641.75 and Standard Deviation = 314.77

z-scoreof1951=1951641.75314.77=1309.25314.77=4.159

z-scoreof1522=1522641.75314.77=880.25314.77=2.79

Based on the values of the z-score, 1522 is not a highly suspect outlier. Therefore, none of the values lower than 1522 are outliers.

So, the only highly suspect outlier is 1951.

03

Creating a scatterplot without the highly suspect outlier.

1951 is a highly suspect outlier. Therefore, we will remove that from the data and create the scatterplot.

There is not much difference in the scatterplot after removing the highly suspect outlier.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Question: State SAT scores.Refer to Exercise 2.27 (p. 84) and the data on state SAT scores. Construct a scatterplot for the data, with the 2010 Math SAT score on the horizontal axis and the 2014 Math SAT score on the vertical axis. What type of trend do you detect?

State

2010 Math SAT

2014 Math SAT

Alabama

538

550

Alaska

503

513

Arizona

527

524

Arkansa

571

564

California

510

516

Wisconsin

608

603

Wyoming

599

565

Salary offers to MBAs.Consider the top salary offer (in thousands of dollars) received by each member of a sample of 50 MBA students who graduated from the Graduate School of Management at Rutgers, the state university of New Jersey. Descriptive statistics and a box plot for the data are shown on the XLSTAT printouts at the top of the next column. [Note:The “+” on the box plot represents the location of the mean.]

a.Find and interpret the z-score associated with the highest salary offer, the lowest salary offer, and the mean salary offer. Would you consider the highest offer to be unusually high? Why or why not?

b.Based on the box plot for this data set, which salary offers (if any) are suspect or highly suspect outliers?

Using only integers between 0 and 10, construct two data sets with at least 10 observations each that have the same range but different means. Construct a dot plot for each of your data sets, and mark the mean of each data set on its dot diagram.

Salaries of bachelor’s degree graduates. PayScale, Inc., an online provider of global compensation data, conducts an annual salary survey of bachelor’s degree graduates. Three of the many variables measured by PayScale are the graduate’s current salary, mid-career salary, and the college or university where they obtained their degree. Descriptive statistics are provided for each of the over 400 colleges and universities that graduates attended. For example, graduates of the University of South Florida (USF) had a mean current salary of \(57,000, a median mid-career salary of \)48,000, and a mid-career 90th percentile salary of $131,000. Describe the salary distribution of USF bachelor’s degree graduates by interpreting each of these summary statistics.

Microsoft program security issues. To help its users combat malicious attacks (e.g., worms, viruses) on its computer software, Microsoft periodically issues a security bulletin that reports the software affected by the vulnerability. In Computers & Security (July 2013), researchers focused on reported security issues with three Microsoft products: Office, Windows, and Explorer

a. In a sample of 50 security bulletins issued in a recent year, 32 reported a security issue with Windows, 6 with Explorer, and 12 with Office. Construct a pie chart to describe the Microsoft products with security issues. Which product had the lowest proportion of security issues?

b. The researchers also categorized the security bulletins according to the expected repercussion of the vulnerability. Categories were Denial of service, Information disclosure, Remote code execution, Spoofing, and Privilege elevation. Suppose that of the 50 bulletins sampled, the following numbers of bulletins were classified into each respective category: 6, 8, 22, 3, 11. Construct a Pareto diagram to describe the expected repercussions from security issues. Based on the graph, what repercussion would you advise Microsoft to focus on?

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free