Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Benford’s Law. According to Benford’s law, a variety of different data sets include numbers with leading (first) digits that follow the distribution shown in the table below. In Exercises 21–24, test for goodness-of-fit with the distribution described by Benford’s law.

Leading Digits

Benford's Law: Distributuon of leading digits

1

30.10%

2

17.60%

3

12.50%

4

9.70%

5

7.90%

6

6.70%

7

5.80%

8

5.10%

9

4.60%

Tax Cheating? Frequencies of leading digits from IRS tax files are 152, 89, 63, 48, 39, 40, 28, 25, and 27 (corresponding to the leading digits of 1, 2, 3, 4, 5, 6, 7, 8, and 9, respectively, based on data from Mark Nigrini, who provides software for Benford data analysis). Using a 0.05 significance level, test for goodness-of-fit with Benford’s law. Does it appear that the tax entries are legitimate?

Short Answer

Expert verified

There is not enough evidence to conclude thatthe observed frequencies of the leading digits are not the same as the frequencies expected from Benford’s law.

Yes, it appears that the tax entries are legitimate.

Step by step solution

01

Given information

The frequencies of the different leading digits from IRS tax files are recorded.

02

Check the requirements

Assume that random sampling is conducted.

Let O denote the observed frequencies of the leading digits.

The observed frequencies are noted below:

\(\begin{aligned}{c}{O_1} = 152\\{O_2} = 89\\{O_3} = 63\;\;\\{O_4} = 48\end{aligned}\)

\({O_5} = 39\)

\(\begin{aligned}{c}{O_6} = 40\\{O_7} = 28\;\;\\{O_8} = 25\;\;\\{O_9} = 27\end{aligned}\)

The sum of all observed frequencies is computed below:

\(\begin{aligned}{c}n = 152 + 89 + ... + 27\\ = 511\end{aligned}\)

Let E denote the expected frequencies.

Let the expected proportion and expected frequencies of the ith digit as given by Benford’s law.

Leading Digits

Benford's Law: Distributuon of leading digits

Proportions

Expected Frequencies

1

30.10%

\(\begin{aligned}{c}{p_1} = \frac{{30.1}}{{100}}\\ = 0.301\end{aligned}\)

\[\begin{aligned}{c}{E_1} = n{p_1}\\ = 511\left( {0.301} \right)\\ = 153.811\end{aligned}\]

2

17.60%

\(\begin{aligned}{c}{p_2} = \frac{{17.6}}{{100}}\\ = 0.176\end{aligned}\)

\[\begin{aligned}{c}{E_2} = n{p_2}\\ = 511\left( {0.176} \right)\\ = 89.936\end{aligned}\]

3

12.50%

\(\begin{aligned}{c}{p_3} = \frac{{12.5}}{{100}}\\ = 0.125\end{aligned}\)

\[\begin{aligned}{c}{E_3} = n{p_3}\\ = 511\left( {0.125} \right)\\ = 63.875\end{aligned}\]

4

9.70%

\[\begin{aligned}{c}{p_4} = \frac{{9.7}}{{100}}\\ = 0.097\end{aligned}\]

\[\begin{aligned}{c}{E_4} = n{p_4}\\ = 511\left( {0.097} \right)\\ = 49.567\end{aligned}\]

5

7.90%

\[\begin{aligned}{c}{p_5} = \frac{{7.9}}{{100}}\\ = 0.079\end{aligned}\]

\[\begin{aligned}{c}{E_5} = n{p_5}\\ = 511\left( {0.079} \right)\\ = 40.369\end{aligned}\]

6

6.70%

\[\begin{aligned}{c}{p_6} = \frac{{6.7}}{{100}}\\ = 0.067\end{aligned}\]

\[\begin{aligned}{c}{E_6} = n{p_6}\\ = 511\left( {0.067} \right)\\ = 34.237\end{aligned}\]

7

5.80%

\(\begin{aligned}{c}{p_7} = \frac{{5.8}}{{100}}\\ = 0.058\end{aligned}\)

\[\begin{aligned}{c}{E_7} = n{p_7}\\ = 511\left( {0.058} \right)\\ = 29.638\end{aligned}\]

8

5.10%

\(\begin{aligned}{c}{p_8} = \frac{{5.1}}{{100}}\\ = 0.051\end{aligned}\)

\[\begin{aligned}{c}{E_8} = n{p_8}\\ = 511\left( {0.051} \right)\\ = 26.061\end{aligned}\]

9

4.60%

\(\begin{aligned}{c}{p_9} = \frac{{4.6}}{{100}}\\ = 0.046\end{aligned}\)

\[\begin{aligned}{c}{E_9} = n{p_9}\\ = 511\left( {0.046} \right)\\ = 23.506\end{aligned}\]

As all the expected values are higher than 5, the requirements of the test are satisfied.

03

State the hypotheses

The null hypothesis for conducting the given test is as follows:

The observed frequencies of leading digits are the same as the frequencies expected from Benford’s law.

The alternative hypothesis is as follows:

The observed frequencies of leading digits are not the same as the frequencies expected from Benford’s law.

04

Conduct the hypothesis test

The table below shows the necessary calculations:

Leading Digits

O

E

\(\left( {O - E} \right)\)

\(\frac{{{{\left( {O - E} \right)}^2}}}{E}\)

1

152

153.811

-1.811

0.021323

2

89

89.936

-0.936

0.009741

3

63

63.875

-0.875

0.011986

4

48

49.567

-1.567

0.049539

5

39

40.369

-1.369

0.046426

6

40

34.237

5.763

0.970067

7

28

29.638

-1.638

0.090527

8

25

26.061

-1.061

0.043196

9

27

23.506

3.494

0.519358

The value of the test statistic is equal to:

\[\begin{aligned}{c}{\chi ^2} = \sum {\frac{{{{\left( {O - E} \right)}^2}}}{E}} \\ = 0.021323 + 0.009741 + ... + 0.519358\\ = 1.762163\end{aligned}\]

Thus,\({\chi ^2} = 1.762\).

Let k be the number of digits, which are 9.

The degrees of freedom for\({\chi ^2}\)is computed below:

\(\begin{aligned}{c}df = k - 1\\ = 9 - 1\\ = 8\end{aligned}\)

05

State the conclusion

The critical value of\({\chi ^2}\)at\(\alpha = 0.05\)with 8 degrees of freedom is equal to 15.507, obtained using the chi-square table.

The p-value is equal to 0.987.

Since the test statistic value is less than the critical value and the p-value is greater than 0.05, the null hypothesis is failed to be rejected.

There is not enough evidence to conclude thatthe observed frequencies of the leading digits are not the same as the frequencies expected from Benford’s law.

Yes, it appears that the tax entries are legitimate.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

In Exercises 5–20, conduct the hypothesis test and provide the test statistic and the P-value and, or critical value, and state the conclusion.

Baseball Player Births In his book Outliers, author Malcolm Gladwell argues that more baseball players have birth dates in the months immediately following July 31, because that was the age cutoff date for nonschool baseball leagues. Here is a sample of frequency counts of months of birth dates of American-born Major League Baseball players starting with January: 387, 329, 366, 344, 336, 313, 313, 503, 421, 434, 398, 371. Using a 0.05 significance level, is there sufficient evidence to warrant rejection of the claim that American-born Major League Baseball players are born in different months with the same frequency? Do the sample values appear to support Gladwell’s claim?

One Big Bill or Many Smaller Bills In a study of the “denomination effect,” 150 women in China were given either a single 100 yuan bill or a total of 100 yuan in smaller bills. The value of 100 yuan is about $15. The women were given the choice of spending the money on specific items or keeping the money. The results are summarized in the table below (based on “The Denomination Effect,” by Priya Raghubir and Joydeep Srivastava, Journal of Consumer Research, Vol. 36). Use a 0.05 significance level to test the claim that the form of the 100 yuan is independent of whether the money was spent. What does the result suggest about a denomination effect?

Spent the Money

Kept the Money

Women Given a Single 100-Yuan Bill

60

15

Women Given 100 Yuan in Smaller Bills

68

7

Critical Thinking: Was Allstate wrong? The Allstate insurance company once issued a press release listing zodiac signs along with the corresponding numbers of automobile crashes, as shown in the first and last columns in the table below. In the original press release, Allstate included comments such as one stating that Virgos are worried and shy, and they were involved in 211,650 accidents, making them the worst offenders. Allstate quickly issued an apology and retraction. In a press release, Allstate included this: “Astrological signs have absolutely no role in how we base coverage and set rates. Rating by astrology would not be actuarially sound.”

Analyzing the Results The original Allstate press release did not include the lengths (days) of the different zodiac signs. The preceding table lists those lengths in the third column. A reasonable explanation for the different numbers of crashes is that they should be proportional to the lengths of the zodiac signs. For example, people are born under the Capricorn sign on 29 days out of the 365 days in the year, so they are expected to have 29/365 of the total number of crashes. Use the methods of this chapter to determine whether this appears to explain the results in the table. Write a brief report of your findings.

Zodiac sign

Dates

Length(days)

Crashes

Capricorn

Jan.18-Feb. 15

29

128,005

Aquarius

Feb.16-March 11

24

106,878

Pisces

March 12-April 16

36

172,030

Aries

April 17-May 13

27

112,402

Taurus

May 14-June 19

37

177,503

Gemini

June 20-July 20

31

136,904

Cancer

July21-Aug.9

20

101,539

Leo

Aug.10-Sep.15

37

179,657

Virgo

Sep.16-Oct.30

45

211,650

Libra

Oct.31-Nov 22

23

110,592

Scorpio

Nov. 23-Nov. 28

6

26,833

Ophiuchus

Nov.29-Dec.17

19

83,234

Sagittarius

Dec.18-Jan.17

31

154,477

In soccer, serious fouls in the penalty box result in a penalty kick withone kicker and one defending goalkeeper. The table below summarizes results from 286 kicksduring games among top teams (based on data from “Action Bias Among Elite Soccer Goalkeepers:

The Case of Penalty Kicks,” by Bar-Eli et al., Journal of Economic Psychology,Vol.28, No. 5). In the table, jump direction indicates which way the goalkeeper jumped, where thekick direction is from the perspective of the goalkeeper. Use a 0.05 significance level to test theclaim that the direction of the kick is independent of the direction of the goalkeeper jump. Dothe results support the theory that because the kicks are so fast, goalkeepers have no time toreact, so the directions of their jumps are independent of the directions of the kicks?

Goalkeeper Jump

Left

Center

Right

Kick to Left

54

1

37

Kick to Center

41

10

31

Kick to Right

46

7

59

In a clinical trial of the effectiveness of echinacea for preventing

colds, the results in the table below were obtained (based on data from “An Evaluation of Echinacea Angustifoliain Experimental Rhinovirus Infections,” by Turner et al., NewEngland Journal of Medicine,Vol. 353, No. 4). Use a 0.05 significance level to test the claim that getting a cold is independent of the treatment group. What do the results suggest about the

effectiveness of echinacea as a prevention against colds?

Treatment Group


Placebo

Echinacea:

20% Extract

Echinacea:

60% Extract

Got a Cold

88

48

42

Did Not Get a Cold

15

4

10

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free