Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

According to the article "Modeling and Predicting the Effects of Submerged Arc Weldment Process Parameters on Weldment Characteristics and Shape Profiles" (J. of Engr. Manuf., \(2012: 1230 - 1240\)), the submerged arc welding (SAW) process is commonly used for joining thick plates and pipes. The heat affected zone (HAZ), a band created within the base metal during welding, was of particular interest to the investigators. Here are observations on depth\((mm)\)of the HAZ both when the current setting was high and when it was lower.

\(\begin{array}{*{20}{l}}{ Non - high }&{1.04}&{1.15}&{1.23}&{1.69}&{1.92}\\{}&{1.98}&{2.36}&{2.49}&{2.72}&{}\\{}&{1.37}&{1.43}&{1.57}&{1.71}&{1.94}\\{ High }&{2.06}&{2.55}&{2.64}&{2.82}&{}\\{}&{1.55}&{2.02}&{2.02}&{2.05}&{2.35}\\{}&{2.57}&{2.93}&{2.94}&{2.97}&{}\end{array}\)

a. Construct a comparative boxplot and comment on interesting features.

b. Is it reasonable to use the two-sample \(t\) test to test hypotheses about the difference between true average HAZ depths for the two conditions?

c. Does it appear that true average HAZ depth is larger for the higher current condition than for the lower condition? Carry out a test of appropriate hypotheses using a significance level of . \(01\).

Short Answer

Expert verified

a)Non-high: Roughly symmetric High: Strongly skewed.

b) Reasonable

C) There is not sufficient evidence to support the claim that the true average HAZ depth is larger for the higher current condition than for the lower condition.

Step by step solution

01

a)Step 1: To find the quartile for Non high values

Given:

Non-high (sorted):\(1.04, 1.15, 1.23, 1.37, 1.43, 1.57, 1.69, 1.71, 1.92, 1.94, 1.98, 2.06, 2.36, 2.49, 2.55, 2.64, 2.72, 2.82\)

High: \(\;1.55,{\rm{ }}2.02,{\rm{ }}2.02,{\rm{ }}2.05,{\rm{ }}2.35,{\rm{ }}2.57,{\rm{ }}2.93,{\rm{ }}2.94,{\rm{ }}2.97\)

NON HIGH

The minimum is\(1.04\).

Since the number of data values is even, the median is the average of the two middle values of the sorted data set:

\(M = {Q_2} = \frac{{19.2 + 19.4}}{2} = 1.93\)

The first quartile is the median of the data values below the median (or at\(25\% \)of the data):

\({Q_1} = 1.43\)

The third quartile is the median of the data values above the median (or at\(75\% \)of the data):

\({Q_3} = 2.49\)

The maximum is \(2.82.\)

02

To find the quartile for High values

HIGH

The minimum is\(1.55.\)

Since the number of data values is odd, the median is the middle value of the sorted data set:

\(M = {Q_2} = 2.35\)

The first quartile is the median of the data values below the median (or at\(25\% \)of the data):

\({Q_1} = \frac{{2.02 + 2.02}}{2} = 2.02\)

The third quartile is the median of the data values above the median (or at\(75\% \)of the data):

\({Q_3} = \frac{{2.93 + 2.94}}{2} = 2.935\)

The maximum is \(2.97\).

03

Plot the graph

BOXPLOT

The whiskers of the boxplot are at the minimum and maximum value. The box starts at the first quartile, ends at the third quartile and has a vertical line at the median.

The first quartile is at \(25\% \)of the sorted data list, the median at \(50\% \)and the third quartile at \(75\% .\)

Because the box of the boxplot lies to the right in between the whiskers, the distribution of "High" is severely skewed.

Because the box of the boxplot lies nearly in the middle between the whiskers and the vertical line of the median in the box of the boxplot lies roughly in the middle of the box of the boxplot, the distribution of "Non-high" is roughly symmetric.

04

b)Step 4: To find the quartile for Non high values

Given:

Non-high (sorted):\(1.04,1.15,1.23,1.37,1.43,1.57,1.69,1.71,1.92,1.94,1.98,2.06,2.36,2.49,2.55,2.64,2.72,2.82\)

High: \(1.55,2.02,2.02,2.05,2.35,2.57,2.93,2.94,2.97\)

If we want to perform a two-sample\(t\)test, then we require that both sampling distributions of the sample mean are approximately normal.

NON HIGH

We will create a normal probability plot.

The data values are on the horizontal axis and the standardized normal scores are on the vertical axis.

If the data contains \(n\)data values, then the standardized normal scores are the z-scores in the normal probability table of the appendix corresponding to an area of\(\frac{{j - 0.5}}{n}\)(or the closest area) with\(j \in \{ 1,2,3, \ldots .,n\} \).

The smallest standardized score corresponds with the smallest data value, the second smallest standardized score corresponds with the second smallest data value, and so on.

05

To find the quartile for Non high values

HIGH

We will create a normal probability plot.

The data values are on the horizontal axis and the standardized normal scores are on the vertical axis.

If the data contains\(n\)data values, then the standardized normal scores are the z-scores in the normal probability table of the appendix corresponding to an area of\(\frac{{j - 0.5}}{n}\)(or the closest area) with\(j \in \{ 1,2,3, \ldots ,n\} \).

The smallest standardized score corresponds with the smallest data value, the second smallest standardized score corresponds with the second smallest data value, and so on.

If the pattern in the normal probability plot is roughly linear and does not contain strong The population distribution is approximated normal if the pattern in the normal probability plot is generally linear and does not contain substantial curvature.

Because neither probability map has a lot of curvature and is roughly linear, both population distributions are roughly normal.

The sampling distribution of the sample mean(s) \(\bar x\) is similarly nearly normal since the population distributions are approximately normal. As a result, the two-sample test is suitable.

06

c)Step 6: Determine the standard deviation

Given:

\(\begin{array}{l}{n_1} = 18\\{n_2} = 9\\\alpha = 0.01\end{array}\)

The mean is the sum of all values divided by the number of values:

\(\begin{array}{l}{{\bar x}_1} = \frac{{1.04 + 1.15 + 1.23 + \ldots + 2.55 + 2.64 + 2.82}}{{18}} \approx 1.9261\\{{\bar x}_2} = \frac{{1.55 + 2.02 + 2.02 + \ldots + 2.93 + 2.94 + 2.97}}{9} \approx 2.3778\end{array}\)

The variance is the sum of squared deviations from the mean divided by\(n - 1\). The standard deviation is the square root of the variance:

\(\begin{array}{l}{s_1} = \sqrt {\frac{{{{(1.04 - 1.9261)}^2} + \ldots . + {{(2.82 - 1.9261)}^2}}}{{18 - 1}}} \approx 0.5694\\{s_2} = \sqrt {\frac{{{{(1.55 - 2.3778)}^2} + \ldots . + {{(2.97 - 2.3778)}^2}}}{{9 - 1}}} \approx 0.5072\end{array}\)

Given claim: larger the the higher current condition.

The claim is either the null hypothesis or the alternative hypothesis. The null hypothesis and the alternative hypothesis state the opposite of each other. The null hypothesis needs to contain the value mentioned in the claim.

\(\begin{array}{l}{H_0}:{\mu _1} = {\mu _2}\\{H_a}:{\mu _1} < {\mu _2}\end{array}\)

07

Find the test statistic

Determine the test statistic:

\(t = \frac{{{{\bar x}_1} - {{\bar x}_2}}}{{\sqrt {\frac{{s_1^2}}{{{n_1}}} + \frac{{s_2^2}}{{{n_2}}}} }} = \frac{{1.9261 - 2.3778}}{{\sqrt {\frac{{{{0.5694}^2}}}{{18}} + \frac{{{{0.5072}^2}}}{9}} }} \approx - 2.093\)

Determine the degrees of freedom (rounded down to the nearest integer):

\(\Delta = \frac{{{{\left( {\frac{{s_1^2}}{{{n_1}}} + \frac{{s_2^2}}{{{n_2}}}} \right)}^2}}}{{\frac{{{{\left( {s_1^2/{n_1}} \right)}^2}}}{{{n_1} - 1}} + \frac{{{{\left( {s_2^2/{n_2}} \right)}^2}}}{{{n_2} - 1}}}} = \frac{{{{\left( {\frac{{{{0.5694}^2}}}{{18}} + \frac{{{{0.5072}^2}}}{9}} \right)}^2}}}{{\frac{{{{\left( {{{0.5694}^2}/18} \right)}^2}}}{{18 - 1}} + \frac{{{{\left( {{{0.5072}^2}/9} \right)}^2}}}{{9 - 1}}}} \approx 17\)

The P-value is the probability of obtaining the value of the test statistic, or a value more extreme. The P-value is the number (or interval) in the column title of Student's T distribution in the appendix containing the t-value in the row\(df = 17\):

\(0.025 < P < 0.05\)

If the P-value is less than or equal to the significance level, then the null hypothesis is rejected:

\(P > 0.01 \Rightarrow {\rm{ Fail to reject }}{H_0}\)

There is not sufficient evidence to support the claim that the true average HAZ depth is larger for the higher current condition than for the lower condition.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Damage to grapes from bird predation is a serious problem for grape growers. The article "'Experimental Method to Investigate and Monitor Bird Behavior and Damage to Vineyards" (Amer. \(J\). of Enology and Viticulture, 2004: 288-291) reported on an experiment involving a bird-feeder table, time-lapse video, and artificial foods. Information was collected for two different bird species at both the experimental location and at a natural vineyard setting. Consider the following data on time (sec) spent on a single visit to the location.

\(\begin{array}{l}Species Location n \overline x SEmean\\Blackbirds Exptl 65 13.4 2.05 \\Blackbirds Natural 50 9.7 1.76\\Silvereyes Exptl 34 49.4 4.78\\Silvereyes Natural 46 38.4 5.06\end{array}\)

a. Calculate an upper confidence bound for the true average time that blackbirds spend on a single visit at the experimental location.

b. Does it appear that true average time spent by blackbirds at the experimental location exceeds the true average time birds of this type spend at the natural location? Carry out a test of appropriate hypotheses.

c. Estimate the difference between the true average time blackbirds spend at the natural location and true average time that silvereyes spend at the natural

Suppose a level .05 test of \({H_0}:{\mu _1} - {\mu _2} = 0\) versus \({H_a}:{\mu _1} - {\mu _2} > 0\) is to be performed, assuming \((m = n)\)\({\sigma _1} = {\sigma _2} = \)10 and normality of both distributions, using equal sample sizes. Evaluate the probability of a type II error when \({\mu _1} - {\mu _2} = 1\) and\(n = 25,100,2500\), and 10,000. Can you think of real problems in which the difference \({\mu _1} - {\mu _2} = 1\)has little practical significance? Would sample sizes of n=10,000 be desirable in such problems?

Torsion during hip external rotation (ER) and extension may be responsible for certain kinds of injuries in golfers and other athletes. The article "Hip Rotational Velocities During the Full Golf Swing" (J. of Sports Science and Medicine, 2009: 296-299) reported on a study in which peak ER velocity and peak IR (internal rotation) velocity (both in deg.sec\(^{ - 1}\)) were determined for a sample of 15 female collegiate golfers during their swings. The following data was supplied by the article's authors.

\(\begin{aligned}{*{20}{r}}{ Golfer }&{ \backslash multicolumn1c ER }&{ IR }&{ \backslash multicolumn1c diff }&{z perc }\\1&{ - 130.6}&{ - 98.9}&{ - 31.7}&{ - 1.28}\\2&{ - 125.1}&{ - 115.9}&{ - 9.2}&{ - 0.97}\\3&{ - 51.7}&{ - 161.6}&{109.9}&{0.34}\\4&{ - 179.7}&{ - 196.9}&{17.2}&{ - 0.73}\\5&{ - 130.5}&{ - 170.7}&{40.2}&{ - 0.34}\\6&{ - 101.0}&{ - 274.9}&{173.9}&{0.97}\\7&{ - 24.4}&{ - 275.0}&{250.6}&{1.83}\\8&{ - 231.1}&{ - 275.7}&{44.6}&{ - 0.17}\\9&{ - 186.8}&{ - 214.6}&{27.8}&{ - 0.52}\\{10}&{ - 58.5}&{ - 117.8}&{59.3}&{0.00}\\{11}&{ - 219.3}&{ - 326.7}&{107.4}&{0.17}\\{12}&{ - 113.1}&{ - 272.9}&{159.8}&{0.73}\\{13}&{ - 244.3}&{ - 429.1}&{184.8}&{1.28}\\{14}&{ - 184.4}&{ - 140.6}&{ - 43.8}&{ - 1.83}\\{15}&{ - 199.2}&{ - 345.6}&{146.4}&{0.52}\\{}&{}&{}&{}&{}\end{aligned}\)

a. Is it plausible that the differences came from a normally distributed population?

b. The article reported that mean\(( \pm SD) = - 145.3(68.0)\)for ER velocity and\( = - 227.8(96.6)\)for IR velocity. Based just on this information, could a test of hypotheses about the difference between true average IR velocity and true average ER velocity be carried out? Explain.

c. The article stated that " The lead hip peak IR velocity was significantly greater than the trail hip ER velocity\(\;\left( {p = 0.003, t value = 3.65} \right)\). "$ (The phrasing suggests that an upper-tailed test was used.) Is that in fact the case? (Note: "\(p = .033 \)in Table 2 of the article is erroneous.)

Many freeways have service (or logo) signs that give information on attractions, camping, lodging, food, and gas services prior to off-ramps. These signs typically do not provide information on distances. The article "Evaluation of Adding Distance Information to Freeway-Specific Service (Logo) Signs" \((J\). of Transp. Engr., 2011: 782-788) reported that in one investigation, six sites along Virginia interstate highways where service signs are posted were selected. For each site, crash data was obtained for a three-year period before distance information was added to the service signs and for a one-year period afterward. The number of crashes per year before and after the sign changes were as follows:

\(\begin{array}{*{20}{l}}{Before:\;\;15\;26\;66\;115\;62\;64}\\{After:\;\;\;\;\;16\;24\;42\;80\;\;78\;73}\end{array}\)

a. The cited article included the statement "A paired\(t\)test was performed to determine whether there was any change in the mean number of crashes before and after the addition of distance information on the signs." Carry out such a test. (Note: The relevant normal probability plot shows a substantial linear pattern.)

b. If a seventh site were to be randomly selected among locations bearing service signs, between what values would you predict the difference in number of crashes to lie?

It is well known that a placebo, a fake medication or treatment, can sometimes have a positive effect just because patients often expect the medication or treatment to be helpful. The article "Beware the Nocebo Effect" (New York Times, Aug. 12, 2012) gave examples of a less familiar phenomenon, the tendency for patients informed of possible side effects to actually experience those side effects. The article cited a study reported in The Journal of Sexual Medicine in which a group of patients diagnosed with benign prostatic hyperplasia was randomly divided into two subgroups. One subgroup of size 55 received a compound of proven efficacy along with counseling that a potential side effect of the treatment was erectile dysfunction. The other subgroup of size 52 was given the same treatment without counseling. The percentage of the no-counseling subgroup that reported one or more sexual side effects was 15.3 %, whereas 43.6 % of the counseling subgroup reported at least one sexual side effect. State and test the appropriate hypotheses at significance level .05 to decide whether the nocebo effect is operating here. (Note: The estimated expected number of "successes" in the no-counseling sample is a bit shy of 10, but not by enough to be of great concern (some sources use a less conservative cutoff of 5 rather than 10).)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free