Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CSI Statistics Police sometimes measure shoe prints at crime scenes so that they can learn something about criminals. Listed below are shoe print lengths, foot lengths, and heights of males (from Data Set 2 “Foot and Height” in Appendix B). Is there sufficient evidence to conclude that there is a linear correlation between shoe print lengths and heights of males? Based on these results, does it appear that police can use a shoe print length to estimate the height of a male?

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

Short Answer

Expert verified

The scatterplot is shown below:

The value of the correlation coefficient is 0.591.

The p-value is 0.294.

There is not enough evidence to support the claim that there is a linear correlation between the two variables.

As the scatterplot only reveals an upward trend and no specific association, the shoe print length cannot be used for estimating the height ofmales.

Step by step solution

01

Given information

The data for shoe print lengths, foot lengths, and heights of males is recorded.

Shoe print(cm)

29.7

29.7

31.4

31.8

27.6

Foot length(cm)

25.7

25.4

27.9

26.7

25.1

Height (cm)

175.3

177.8

185.4

175.3

172.7

02

Sketch a scatterplot

A two-dimensional plot based on a paired data plotted with reference to two axes is called a scatterplot.

Steps to sketch a scatterplot:

  1. Mark horizontal axis for shoe print lengthand vertical axis for the height of males.
  2. Mark points for the paired observations corresponding to both axes.

The resultant graph is the scatterplot.

03

Compute the measure of the correlation coefficient

The formula for the correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let shoe print be defined by variable x and the height of males be defined by variable y.

The valuesare listed in the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

29.7

175.3

882.09

30730.09

5206.41

29.7

177.8

882.09

31612.84

5280.66

31.4

185.4

985.96

34373.16

5821.56

31.8

175.3

1011.24

30730.09

5574.54

27.6

172.7

761.76

29825.29

4766.52

\(\sum x = 150.2\)

\(\sum y = 886.5\)

\(\sum {{x^2}} = 4523.14\)

\(\sum {{y^2} = } \;157271.5\)

\(\sum {xy\; = \;} 26649.69\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{5\left( {26649.69} \right) - \left( {150.2} \right)\left( {886.5} \right)}}{{\sqrt {5\left( {157271.5} \right) - {{\left( {150.2} \right)}^2}} \sqrt {5\left( {4523.14} \right) - {{\left( {886.5} \right)}^2}} }}\\ &= 0.591\end{aligned}\)

Thus, the correlation coefficient is 0.591.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)as the actual value of the correlation coefficient for shoe print length and the height of males.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{{\rm{H}}_{\rm{o}}}:\rho = 0\\{{\rm{{\rm H}}}_{\rm{a}}}:\rho \ne 0\end{array}\)

The samplesize is 5 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ = \frac{{0.591}}{{\sqrt {\frac{{1 - {{0.591}^2}}}{{5 - 2}}} }}\\ &= 1.27\end{aligned}\)

Thus, the test statistic is 1.270.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 5 - 2\\ &= 3.\end{aligned}\)

Thep-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > t} \right)\\ &= 2P\left( {T > 1.27} \right)\\ &= 2\left( {1 - P\left( {T < 1.27} \right)} \right)\\ &= 0.294\end{aligned}\)

Thus, the p-value is 0.294.

Since the p-value is greater than 0.05, the null hypothesis fails to berejected.

Therefore, there is not enough evidence to conclude that the variables shoe print length and height have a linear correlation between them.

05

Analyze if the shoe print length can help predict the height of males

Since the two variables are not linearly correlated to one other, one variable cannot be used to estimate the other.

Also, the scatterplot reveals no specific pattern between shoe print length and height of men (linear or non-linear).Thus, the two variables are not associated with one other.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

If you had computed the value of the linear correlation coefficient to be 1.500, what should you conclude?

Stocks and Sunspots. Listed below are annual high values of the Dow Jones Industrial Average (DJIA) and annual mean sunspot numbers for eight recent years. Use the data for Exercises 1–5. A sunspot number is a measure of sunspots or groups of sunspots on the surface of the sun. The DJIA is a commonly used index that is a weighted mean calculated from different stock values.

DJIA

14,198

13,338

10,606

11,625

12,929

13,589

16,577

18,054

Sunspot Number

7.5

2.9

3.1

16.5

55.7

57.6

64.7

79.3

z Scores Using only the sunspot numbers, identify the highest number and convert it to a z score. In the context of these sample data, is that highest value “significantly high”? Why or why not?

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fontanels) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain intensity before Duragestic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6

Pain intensity after Duragestic Treatment

0.4

1.4

1.8

2.9

6.0

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8.0

6.8

2.3

0.4

0.7

1.2

4.5

2.0

1.6

2.0

2.0

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1

Regression:Use the given data to find the equation of the regression line. Let the response (y) variable be the pain intensity after treatment. What would be the equation of the regression line for a treatment having absolutely no effect?

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

CPI and the Subway Use CPI>subway data from the preceding exercise to determine whether there is a significant linear correlation between the CPI (Consumer Price Index) and the subway fare.

Interpreting the Coefficient of Determination. In Exercises 5–8, use the value of the linear correlation coefficient r to find the coefficient of determination and the percentage of the total variation that can be explained by the linear relationship between the two variables.

Bears r = 0.783 (x = head width of a bear, y = weight of a bear)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free