Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Some commercial airplanes recirculate approximately \(50 \%\) of the cabin air in order to increase fuel efficiency. The authors of the paper "Aircraft Cabin Air Recirculation and Symptoms of the Common Cold" (Journal of the American Medical Association [2002]: \(483-486\) ) studied 1100 airline passengers who flew from San Francisco to Denver between January and April 1999\. Some passengers traveled on airplanes that recirculated air and others traveled on planes that did not recirculate air. Of the 517 passengers who flew on planes that did not recirculate air, 108 reported post-flight respiratory symptoms, while 111 of the 583 passengers on planes that did recirculate air reported such symptoms. Is there sufficient evidence to conclude that the proportion of passengers with post-flight respiratory symptoms differs for planes that do and do not recirculate air? Test the appropriate hypotheses using \(\alpha=.05\). You may assume that it is reasonable to regard these two samples as being independently selected and as representative of the two populations of interest.

Short Answer

Expert verified
The short answer would be determined based on the p-value and the level of significance. If the p-value is less than .05, there is enough evidence to reject the null hypothesis and conclude that the proportion of passengers with post-flight respiratory symptoms differs for planes that do and do not recirculate air. If the p-value is greater than .05, there is not enough evidence to reject the null hypothesis, meaning we can't conclude that there is a difference in the proportions.

Step by step solution

01

State the hypotheses

The null hypothesis (\(H_0\)): The proportion of passengers with post-flight respiratory symptoms on planes that do and don't recirculate air is the same. The alternative hypothesis (\(H_1\)): The two proportions are not equal.
02

Calculate sample proportions

Calculate the proportion of passengers with post-flight respiratory symptoms for each group. For the non-recirculating air group, the proportion (\(p_1\)) is 108/517. For the recirculating air group, the proportion (\(p_2\)) is 111/583.
03

Calculate the standard error

Calculate the standard error using the formula \(SE = \sqrt{p(1 - p)(1/n_1 + 1/n_2)}\) where \(p\) is the pooled sample proportion, calculated as \((x_1 + x_2) / (n_1 + n_2)\) (x - number of successes in a sample, n - sample size).
04

Compute the test statistic

Calculate the test statistic using the formula \(Z = (p_1 - p_2) / SE\).
05

Calculate the p-value

The p-value is the probability of observing a sample statistic as extreme as the test statistic. Since the alternative hypothesis is not directional (it doesn't state which proportion is larger), use the two-tail approach for calculating the p-value. Refer to the standard normal distribution table to find the probability associated with the calculated Z.
06

Conclude the hypothesis test

If the p-value is less than the level of significance (\(\alpha = .05\)), reject the null hypothesis. If the p-value is greater than \(\alpha\), do not reject the null hypothesis. Based on this decision, conclude if there is enough evidence to say that the proportion of passengers with post-flight respiratory symptoms differs for planes that do and do not recirculate air.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Null Hypothesis
The null hypothesis, often symbolized as \(H_0\), is a statement that suggests there is no effect or no difference in a given context. In hypothesis testing, it's the claim that there's no change or that a certain parameter equals a specific value.
In the context of airplane cabin air recirculation, and its connection with post-flight respiratory symptoms, our null hypothesis posits that the proportion of travelers experiencing symptoms is the same, whether or not the aircraft recirculates air.
This hypothesis serves as the starting point for our analysis. It's the idea we aim to test against, using statistical methods, to determine if observed data like collected samples significantly differ, warranting a rejection of this hypothesis.
Alternative Hypothesis
The alternative hypothesis, represented as \(H_1\), suggests that there is an effect or a difference. It's what researchers hope to support through their analysis.
For the airline study, the alternative hypothesis proposes that the proportion of passengers with respiratory symptoms does vary between those on aircrafts with recirculated air versus those without. By identifying this hypothesis, researchers set out to find if the proportions differ, indicating that air recirculation could influence symptom incidence.
The alternative hypothesis is pivotal, as it gives direction to the test, either affirming a change or highlighting a difference when compared to the null hypothesis.
Standard Error
The standard error (SE) measures the accuracy with which a sample represents a population. It is calculated using sample proportions and provides insight into the variability of the data.
In the context of our exercise, we compute the standard error to assess how much the sample proportions of respiratory symptoms among passengers could differ due to random sampling.
The formula for the standard error in comparing two proportions is: \[ SE = \sqrt{p(1 - p)(\frac{1}{n_1} + \frac{1}{n_2})} \] where \( p \) is the pooled sample proportion, \( n_1 \) is the sample size for non-recirculating planes, and \( n_2 \) for recirculating planes.
This computation is crucial for determining if observed differences are statistically significant.
P-Value
The p-value is a probability measure that helps determine the significance of results obtained from a statistical hypothesis test.
A low p-value indicates that the observed data is highly unlikely under the assumption that the null hypothesis is true. Conversely, a high p-value suggests that observed data is plausible when the null hypothesis holds.
In our scenario, we conduct a two-tailed test because our alternative hypothesis only states that the proportions are not equal, without specifying which is greater. We then calculate the p-value using the standard normal distribution table and our test statistic \( Z \).
This value will tell us if the deviations we observe in our test are just due to random chance or if they suggest real differences between the groups.
Z-Test
The Z-test is a statistical method used to determine if there is a significant difference between sample and population means, or among two sample means. It's appropriate when data follows a normal distribution, and we know the population variance. In our case, we compare proportions.
The Z-test formula used to find the test statistic is: \[ Z = \frac{(p_1 - p_2)}{SE} \] where \( p_1 \) and \( p_2 \) are the respective sample proportions of passengers experiencing symptoms in the respective groups, and \( SE \) is the computed standard error.
Once calculated, this Z-value indicates how far our sample's observation deviates from the null hypothesis. We use this in conjunction with the p-value to decide whether to reject the null hypothesis, providing insights into our original research question.

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

The article "Portable MP3 Player Ownership Reaches New High" (Ipsos Insight, June 29,2006 ) reported that in \(2006,20 \%\) of those in a random sample of 1112 Americans age 12 and older indicated that they owned an MP3 player. In a similar survey conducted in \(2005,\) only \(15 \%\) reported owning an \(\mathrm{MP} 3\) player. Suppose that the 2005 figure was also based on a random sample of size \(1112 .\) Estimate the difference in the proportion of Americans age 12 and older who owned an MP3 player in 2006 and the corresponding proportion for 2005 using a \(95 \%\) confidence interval. Is zero included in the interval? What does this tell you about the change in this proportion from 2005 to \(2006 ?\)

The paper "The Truth About Lying in Online Dating Profiles" (Proceedings, Computer-Human Interactions [2007]\(: 1-4)\) describes an investigation in which 40 men and 40 women with online dating profiles agreed to participate in a study. Each participant's height (in inches) was measured and the actual height was compared to the height given in that person's online profile. The differences between the online profile height and the actual height (profile - actual) were used to compute the values in the accompanying table. $$ \begin{array}{ll} \text { Men } & \text { Women } \\ \hline \bar{x}_{d}=0.57 & \bar{x}_{d}=0.03 \\ s_{d}=0.81 & s_{d}=0.75 \\ n=40 & n=40 \end{array} $$ For purposes of this exercise, assume it is reasonable to regard the two samples in this study as being representative of male online daters and female online daters. (Although the authors of the paper believed that their samples were representative of these populations, participants were volunteers recruited through newspaper advertisements, so we should be a bit hesitant to generalize results to all online daters!) a. Use the paired \(t\) test to determine if there is convincing evidence that, on average, male online daters overstate their height in online dating profiles. Use \(\alpha=.05\) b. Construct and interpret a \(95 \%\) confidence interval for the difference between the mean online dating profile height and mean actual height for female online daters. c. Use the two-sample \(t\) test of Section 11.1 to test \(H_{0}: \mu_{m}-\mu_{f}=0\) versus \(H_{a}: \mu_{m}-\mu_{f}>0,\) where \(\mu_{m}\) is the mean height difference (profile - actual) for male online daters and \(\mu_{f}\) is the mean height difference (profile - actual) for female online daters. d. Explain why a paired \(t\) test was used in Part (a) but a two-sample \(t\) test was used in Part (c).

Suppose that you were interested in investigating the effect of a drug that is to be used in the treatment of patients who have glaucoma in both eyes. A comparison between the mean reduction in eye pressure for this drug and for a standard treatment is desired. Both treatments are applied directly to the eye. a. Describe how you would go about collecting data for your investigation. b. Does your method result in paired data? c. Can you think of a reasonable method of collecting data that would not result in paired samples? Would such an experiment be as informative as a paired experiment? Comment.

Do girls think they don't need to take as many science classes as boys? The article "Intentions of Young Students to Enroll in Science Courses in the Future: An Examination of Gender Differences" (Science Education [1999]: 55-76) gives information from a survey of children in grades \(4,5,\) and \(6 .\) The 224 girls participating in the survey each indicated the number of science courses they intended to take in the future, and they also indicated the number of science courses they thought boys their age should take in the future. For each girl, the authors calculated the difference between the number of science classes she intends to take and the number she thinks boys should take. a. Explain why these data are paired. b. The mean of the differences was -.83 (indicating girls intended, on average, to take fewer classes than they thought boys should take), and the standard deviation was 1.51. Construct and interpret a \(95 \%\) confidence interval for the mean difference.

The authors of the paper "Adolescents and MP3 Players: Too Many Risks, Too Few Precautions" \((P \mathrm{e}-\) diatrics [2009]: e953-e958) concluded that more boys than girls listen to music at high volumes. This conclusion was based on data from independent random samples of 764 Dutch boys and 748 Dutch girls age 12 to \(19 .\) Of the boys, 397 reported that they almost always listen to music at a high volume setting. Of the girls, 331 reported listening to music at a high volume setting. Do the sample data support the authors' conclusion that the proportion of Dutch boys who listen to music at high volume is greater than this proportion for Dutch girls? Test the relevant hypotheses using a .01 significance level.

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free