Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Using the Kruskal-Wallis Test. In Exercises 5–8, use the Kruskal-Wallis test.

Correcting the H Test Statistic for Ties In using the Kruskal-Wallis test, there is a correction factor that should be applied whenever there are many ties: Divide H by

\(1 - \frac{{\sum T }}{{{N^3} - N}}\)

First combine all of the sample data into one list, and then, in that combined list, identify the different groups of sample values that are tied. For each individual group of tied observations, identify the number of sample values that are tied and designate that number as t, then calculate\(T = {t^3} - t\). Next, add the T values to get\(\sum T \). The value of N is the total number of observations in all samples combined. Use this procedure to find the corrected value of H for Example 1 in this section on page 628. Does the corrected value of H differ substantially from the value found in Example 1?

Short Answer

Expert verified

The corrected value of H is equal to 0.703.

No, the corrected value of H does not differ significantly from the ordinary value.

Step by step solution

01

Given information

Three samples showing the performance IQ scores of subjects with different blood lead levels are given.

Refer to Example 1 of Section 13-5.The following table shows the ranks:

IQ score

Sample Name

Ranks

85

Low Lead Level

6.5

90

Low Lead Level

8.5

107

Low Lead Level

18.5

85

Low Lead Level

6.5

100

Low Lead Level

15.5

97

Low Lead Level

12.5

101

Low Lead Level

17

64

Low Lead Level

1

78

Medium Lead Level

2

97

Medium Lead Level

12.5

107

Medium Lead Level

18.5

80

Medium Lead Level

4

90

Medium Lead Level

8.5

83

Medium Lead Level

5

93

High Lead Level

10

100

High Lead Level

15.5

97

High Lead Level

12.5

79

High Lead Level

3

97

High Lead Level

12.5

02

Obtain the correction factor

Under the Kruskal-Wallis test, the value of the test statistic (H) is computed without accounting for the number of ties.

Ties represent sample values that occur multiple times in three samples.

If there are ties present, a correction is applied to the value of H.

The correction factor has the following formula:

\(1 - \frac{{\sum T }}{{{N^3} - N}}\)

where

\(T = {t^3} - t\)

Here, t is the number of sample values tied to each tied group.

Referring to Example 1 of Section 13-5, the given table shows the sample values that are tied, the corresponding value of t, and the corresponding value of T.

Sample values that are tied

t

\(T = {t^3} - t\)

85

2

6

90

2

6

107

2

6

97

4

60

100

2

6



Sum=84

Here, the sample size of the three groups is given as:

\(\begin{array}{c}{n_1} = 8\\{n_2} = 6\\{n_3} = 5\\\end{array}\)

The total number of observations is given as:

\(\begin{array}{c}N = 8 + 6 + 5\\ = 19\end{array}\)

The correction factor is computed as shown:

\(\begin{array}{c}1 - \frac{{\sum T }}{{{N^3} - N}} = 1 - \frac{{84}}{{6859 - 19}}\\ = 0.987719\end{array}\)

Thus, the correction factor is equal to 0.987719.

03

Obtain the sum of the ranks

The sum of the ranks corresponding to the low blood lead level is computed as follows:

\(\begin{array}{c}{R_1} = 6.5 + 8.5 + 18.5 + .... + 1\\ = 86\end{array}\)

The sum of the ranks corresponding tothe medium blood lead levelis computed as follows:

\(\begin{array}{c}{R_2} = 2 + 12.5 + 18.5 + .... + 5\\ = 50.5\end{array}\)

The sum of the ranks corresponding to the high blood lead level is computed as follows:

\(\begin{array}{c}{R_3} = 10 + 15.5 + 12.5 + .... + 12.5\\ = 53.5\end{array}\)

04

Calculate the test statistic

The value of the test statistic is computed as shown below:

\)(\begin{array}{c}H = \frac{{12}}{{N\left( {N + 1} \right)}}\left( {\frac{{{R_1}^2}}{{{n_1}}} + \frac{{{R_2}^2}}{{{n_2}}} + \frac{{{R_3}^2}}{{{n_3}}}} \right) - 3\left( {N + 1} \right)\\ = \frac{{12}}{{19\left( {20} \right)}}\left( {\frac{{{{86}^2}}}{8} + \frac{{{{50.5}^2}}}{6} + \frac{{{{53.5}^2}}}{5}} \right) - 3\left( {20} \right)\\ = 0.694\end{array}\)

The ordinary value of H is equal to 0.694.

05

Determine the value of the corrected H

The corrected H can be computed as:

\(\begin{array}{c}{\rm{Corrected}}\;H = \frac{H}{{{\rm{Correction}}\;{\rm{factor}}}}\\ = \frac{{0.694}}{{0.987719}}\\ = 0.703\end{array}\)

Thus, the corrected value of H is equal to 0.703.

06

Compare the H value and the corrected H value

The corrected value of H(0.703) does not differ significantly from the H value (0.694). Thus, it can be said that a large number of ties does not seem to have a considerable amount of effect on the H value.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

What Are We Testing? Refer to the sample data in Exercise 1. Assuming that we use the Wilcoxon rank-sum test with those data, identify the null hypothesis and all possible alternative hypotheses.

Body Temperatures For the matched pairs listed in Exercise 1, identify the following components used in the Wilcoxon signed-ranks test:

a. Differences d

b. The ranks corresponding to the nonzero values of | d |

c. The signed-ranks

d. The sum of the positive ranks and the sum of the absolute values of the negative ranks

e. The value of the test statistic T

f. The critical value of T (assuming a 0.05 significance level in a test of no difference between body temperatures at 8 AM and 12 AM)

Sign Test and Wilcoxon Signed-Ranks Test What is a major advantage of the Wilcoxon signed-ranks test over the sign test when analyzing data consisting of matched pairs?

Measuring Seals from Photos Listed below are the overhead widths (in cm) of seals measured from photographs and the weights of the seals (in kg). The data are based on “Mass Estimation of Weddell Seals Using Techniques of Photogrammetry,” by R. Garrott of Montana State University. The purpose of the study was to determine if weights of seals could be determined from overhead photographs. Is there sufficient evidence to conclude that there is a correlation between overhead widths and the weights of the seals?

Overhead width (cm)

7.2

7.4

9.8

9.4

8.8

8.4

Weight (kg)

116

154

245

202

200

191

Runs Test with Large Samples. In Exercises 9–12, use the runs test with a significance level of\(\alpha \)= 0.05. (All data are listed in order by row.)

Stock Market: Testing for Randomness Above and Below the Median Listed below are the annual high values of the Dow Jones Industrial Average for a recent sequence of years. Find the median, then test for randomness below and above the median. What does the result suggest about the stock market as an investment consideration?

969

995

943

985

969

842

951

1036

1052

892

882

1015

1000

908

898

1000

1024

1071

1287

1287

1553

1956

2722

2184

2791

3000

3169

3413

3794

3978

5216

6561

8259

9374

11568

11401

11350

10635

10454

10855

10941

12464

14198

13279

10580

11625

12929

13589

16577

18054











See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free