Chapter 10: Comparing Two Populations or Groups
Q. 32
Starting in the , medical technology allowed babies with very low birth weight (VLBW, less than grams, or about pounds) to survive without major handicaps. It was noticed that these children nonetheless had difficulties in school and as adults. A long study has followed randomly selected VLBW babies to age years, along with a control group of randomly selected babies from the same population who had normal birth weight.
(a) Is this an experiment or an observational study? Why?
(b) At age of the VLBW group and of the control group had graduated from high school. Is the graduation rate among the VLBW group significantly lower than for the normal-birth-weight controls? Give appropriate statistical evidence to justify your answer.
Q.3.2
To study the long-term effects of preschool programs for poor children, researchers designed an experiment. They recruited children who had never attended preschool from low-income families in Michigan. Researchers randomly assigned of the children to attend preschool (paid for by the study budget) and the other 61 to serve as a control group who would not go to preschool. One response variable of interest was the need for social services as adults. In the past years, children in the preschool group and in the control group have needed social services.
The Minitab output below gives a % confidence interval for . Interpret this interval in context. Then explain what additional information the confidence interval provides.
Q. 33
A nuclear power plant releases water into a nearby lake every afternoon at p.m. Environmental researchers are concerned that fish are being driven away from the area around the plant. They believe that the temperature of the water discharged may be a factor. The scatterplot below shows the temperature of the water (°C) released by the plant and the measured distance (in meters) from the outflow pipe of the plant to the nearest fish found in the water on eight randomly chosen afternoons.
Computer output from a least-squares regression analysis on these data is shown below.
(a) Write the equation of the least-squares regression line. Define any variables you use.
(b) Interpret the slope of the regression line in context.
(c) A residual plot for the regression is shown on the next page. Is a linear model appropriate for describing the relationship between temperature and distance to the nearest fish? Justify your answer.
(d) Does the linear model in part (a) overpredict or underpredict the measured distance from the outflow pipe to the nearest fish found in the water for a temperature of ? Explain your reasoning.
Q. 33
Thirty randomly selected seniors at Council High School were asked to report the age (in years) and mileage of their main vehicles. Here is a scatterplot of the data:
(a) What is the equation of the least-squares regression line? Be sure to define any symbols you use.
(b) Interpret the slope of the least-squares line in the context of this problem. (c) One student reported that her -year-old car had miles on it. Find the residual for this data value. Show your work
Q.34
Drive my car (3.2, 4.3)
(a) Explain what the value of tells you about how well the least-squares line fits the data.
(b) The mean age of the students’ cars in the sample was years. Find the mean mileage of the cars in the sample. Show your work.
(c) Interpret the value of in the context of this setting.
(d) Would it be reasonable to use the least-squares line to predict a car’s mileage from its age for a Council High School teacher? Justify your answer
Q. 34
The Candy Shoppe assembles gift boxes that contain 8 chocolate truffles and two handmade caramel nougats. The truffles have a mean weight of 2 ounces with a standard deviation of 0.5 ounce, and the nougats have a mean weight of 4 ounces with a standard deviation of 1 ounce. The empty boxes weigh 3 ounces with a standard deviation of 0.2 ounce.
(a) Assuming that the weights of the truffles, nougats, and boxes are independent, what are the mean and standard deviation of the weight of a box of candy?
(b) Assuming that the weights of the truffles, nougats, and boxes are approximately Normally distributed, what is the probability that a randomly selected box of candy will weigh more than 30 ounces?
(c) If five gift boxes are randomly selected, what is the probability that at least one of them will weigh more than 30 ounces?
(d) If five gift boxes are randomly selected, what is the probability that the mean weight of the five boxes will be more than 30 ounces?
Q. 35
An investor is comparing two stocks, A and B. She wants to know if over the long run, there is a significant difference in the return on investment as measured by the percent increase or decrease in the price of the stock from its date of purchase. The investor takes a random sample of 50 daily returns over the past five years for each stock. The data are summarized below.
(a) Is there a significant difference in the mean return on investment for the two stocks? Support your answer with appropriate statistical evidence. Use a 5% significance level.
(b) The investor believes that although the return on investment for Stock A usually exceeds that of Stock B, Stock A represents a riskier investment, where the risk is measured by the price volatility of the stock. The standard deviation is a statistical measure of the price volatility and indicates how much an investment’s actual performance during a specified period varies from its average performance over a longer period. Do the price fluctuations in Stock A significantly exceed those of Stock B, as measured by their standard deviations? Identify an appropriate set of hypotheses that the investor is interested in testing.
(c) To measure this, we will construct a test statistic defined
What value(s) of the statistic would indicate that the price fluctuations in Stock A significantly exceed those of Stock B? Explain.
(d) Calculate the value of the F statistic using the information given in the table.
(e) Two hundred simulated values of this test statistic, F, were calculated assuming no difference in the standard deviations of the returns for the two stocks. The results of the simulation are displayed in the following dot plot.
Use these simulated values and the test statistic that you calculated in part (d) to determine whether the observed data provide evidence that Stock A is a riskier investment than Stock B. Explain your reasoning .
Q. 35
The level of cholesterol in the blood for all men aged to follows a Normal distribution with mean milligrams per deciliter (mg/dl) and a standard deviation mg/dl. For -year-old boys, blood cholesterol levels follow a Normal distribution with a mean mg/dl and a standard deviation of mg/dl.
(a) Let M the cholesterol level of a randomly selected to -year-old man and B =the cholesterol level of a randomly selected -year-old boy. Describe the shape, center, and spread of the distribution of
(b) Find the probability that a randomly selected -year-old boy has higher cholesterol than a randomly selected man aged to. Show your work.
Q. 36
The heights of young men follow a Normal distribution with mean of inches and standard deviation inches. The heights of young women follow a Normal distribution with mean inches and standard deviation inches.
(a) Let M the height of a randomly selected young man and W the height of a randomly selected young woman. Describe the shape, center, and spread of the distribution of
(b) Find the probability that a randomly selected young man is at least inches taller than a randomly selected young woman. Show your work.
Q. 37
Refer exercise Suppose we select independent SRSs of men aged to and boys aged and calculate the sample mean heights and .
(a) Describe the shape, center, and spread of the sampling distribution of.
(b) Find the probability of getting a difference in sample means that’s less thanmg/dl. Show your work.
(c) Should we be surprised if the sample mean cholesterol level for the -year-old boys exceeds the sample mean cholesterol level for the men? Explain.