Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Testing for a Linear Correlation. In Exercises 13–28, construct a scatterplot, and find the value of the linear correlation coefficient r. Also find the P-value or the critical values of r from Table A-6. Use a significance level of A = 0.05. Determine whether there is sufficient evidence to support a claim of a linear correlation between the two variables. (Save your work because the same data sets will be used in Section 10-2 exercises.)

Pizza and the Subway The “pizza connection” is the principle that the price of a slice of pizza in New York City is always about the same as the subway fare. Use the data listed below to determine whether there is a significant linear correlation between the cost of a slice of pizza and the subway fare.

Year

1960

1973

1986

1995

2002

2003

2009

2013

2015

Pizza Cost

0.15

0.35

1

1.25

1.75

2

2.25

2.3

2.75

Subway Fare

0.15

0.35

1

1.35

1.5

2

2.25

2.5

2.75

CPI

30.2

48.3

112.3

162.2

191.9

197.8

214.5

233

237.2

Short Answer

Expert verified

The scatter plot is shown below:

The value ofthe correlation coefficient is 0.992.

The p-value is 0.000.

There is enough evidence to support the claim that there is a linear correlation between the two variables(pizza cost and subway fare).

Step by step solution

01

Given information

Association between two variables, pizza cost and subway fare,isbeing studied.

Pizza Cost

Subway Fare

0.15

0.15

0.35

0.35

1

1

1.25

1.35

1.75

1.5

2

2

2.25

2.25

2.3

2.5

2.75

2.75

02

Sketch a scatterplot

A plot that shows observations from two variablesby scaling them on two axes is referred to as a scatterplot.

Steps to sketch a scatterplot:

  1. Mark horizontal axis for price cost and vertical axis for subway fare.
  2. Mark points for each paired value with respect to both axes.

The resultant graph is the required scatterplot.

03

Compute the measure of the correlation coefficient

The formula for the correlation coefficient is

\(r = \frac{{n\sum {xy} - \left( {\sum x } \right)\left( {\sum y } \right)}}{{\sqrt {n\left( {\sum {{x^2}} } \right) - {{\left( {\sum x } \right)}^2}} \sqrt {n\left( {\sum {{y^2}} } \right) - {{\left( {\sum y } \right)}^2}} }}\).

Let pizza cost be defined by variablex and subway fare be defined by variabley.

The valuesare listed in the table below:

x

y

\({x^2}\)

\({y^2}\)

\(xy\)

0.15

0.15

0.0225

0.0225

0.0225

0.35

0.35

0.1225

0.1225

0.1225

1

1

1

1

1

1.25

1.35

1.5625

1.8225

1.6875

1.75

1.5

3.0625

2.25

2.625

2

2

4

4

4

2.25

2.25

5.0625

5.0625

5.0625

2.3

2.5

5.29

6.25

5.75

2.75

2.75

7.5625

7.5625

7.5625

\(\sum x = 13.8\)

\(\sum y = 13.85\)

\(\sum {{x^2}} = 27.685\)

\(\sum {{y^2} = } 28.0925\)

\(\sum {xy\; = \;} 27.8325\)

Substitute the values in the formula:

\(\begin{aligned} r &= \frac{{9\left( {27.8325} \right) - \left( {13.8} \right)\left( {13.85} \right)}}{{\sqrt {9\left( {27.685} \right) - {{\left( {13.8} \right)}^2}} \sqrt {9\left( {28.0925} \right) - {{\left( {13.85} \right)}^2}} }}\\ &= 0.992\end{aligned}\)

Thus, the correlation coefficient is 0.992.

04

Step 4:Conduct a hypothesis test for correlation

Define\(\rho \)as the actual value of thecorrelation coefficient for pizza cost and subway fare.

For testing the claim, form the hypotheses:

\(\begin{array}{l}{{\rm{H}}_{\rm{o}}}:\rho = 0\\{{\rm{{\rm H}}}_{\rm{a}}}:\rho \ne 0\end{array}\)

The samplesize is 9 (n).

The test statistic is computed as follows:

\(\begin{aligned} t &= \frac{r}{{\sqrt {\frac{{1 - {r^2}}}{{n - 2}}} }}\\ &= \frac{{0.992}}{{\sqrt {\frac{{1 - {{0.992}^2}}}{{9 - 2}}} }}\\ &= 20.791\end{aligned}\)

Thus, the test statistic is 20.791.

The degree of freedom is

\(\begin{aligned} df &= n - 2\\ &= 9 - 2\\ &= 7.\end{aligned}\)

Thep-value is computed from the t-distribution table.

\(\begin{aligned} p{\rm{ - value}} &= 2P\left( {T > t} \right)\\ &= 2P\left( {T > 20.791} \right)\\ &= 2\left( {1 - P\left( {t < 20.791} \right)} \right)\\ &= 0.000\end{aligned}\)

Thus, the p-value is 0.000.

Since thep-value is less than 0.05, the null hypothesis is rejected.

Therefore, there is enough evidence to conclude that the variables pizza cost and subway fare have a linear correlation between them.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Interpreting r For the same two variables described in Exercise 1, if we find that r = 0, does that indicate that there is no association between those two variables?

Exercises 13–28 use the same data sets as Exercises 13–28 in Section 10-1. In each case, find the regression equation, letting the first variable be the predictor (x) variable. Find the indicated predicted value by following the prediction procedure summarized in Figure 10-5 on page 493.

Using the listed old/new mpg ratings, find the best predicted new

mpg rating for a car with an old rating of 30 mpg. Is there anything to suggest that the prediction is likely to be quite good?

Critical Thinking: Is the pain medicine Duragesic effective in reducing pain? Listed below are measures of pain intensity before and after using the drug Duragesic (fentanyl) (based on data from Janssen Pharmaceutical Products, L.P.). The data are listed in order by row, and corresponding measures are from the same subject before and after treatment. For example, the first subject had a measure of 1.2 before treatment and a measure of 0.4 after treatment. Each pair of measurements is from one subject, and the intensity of pain was measured using the standard visual analog score. A higher score corresponds to higher pain intensity.

Pain Intensity Before Duragesic Treatment

1.2

1.3

1.5

1.6

8

3.4

3.5

2.8

2.6

2.2

3

7.1

2.3

2.1

3.4

6.4

5

4.2

2.8

3.9

5.2

6.9

6.9

5

5.5

6

5.5

8.6

9.4

10

7.6










Pain Intensity After Duragesic Treatment

0.4

1.4

1.8

2.9

6

1.4

0.7

3.9

0.9

1.8

0.9

9.3

8

6.8

2.3

0.4

0.7

1.2

4.5

2

1.6

2

2

6.8

6.6

4.1

4.6

2.9

5.4

4.8

4.1










Two Independent Samples The methods of Section 9-2 can be used to test the claim that two populations have the same mean. Identify the specific claim that the treatment is effective, then use the methods of Section 9-2 to test that claim. The methods of Section 9-2 are based on the requirement that the samples are independent. Are they independent in this case?

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

Conclusion The linear correlation coefficient r is found to be 0.499, the P-value is 0.393, and the critical values for a 0.05 significance level are\( \pm 0.878\). What should you conclude?

The following exercises are based on the following sample data consisting of numbers of enrolled students (in thousands) and numbers of burglaries for randomly selected large colleges in a recent year (based on data from the New York Times).

Repeat the preceding exercise, assuming that the linear correlation coefficient is r= 0.997.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free