Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Consider a sample \({x_1},{x_2},...,{x_n}\) with neven. Let \({\bar x_L}\) and \({\bar x_U}\) denote the average of the smallest n/2 and the largest n/2 observations, respectively. Show that the mean absolute deviation from the median for this sample satisfies

\(\sum {\left| {{x_i} - \tilde x} \right|} /n = \left( {{{\bar x}_U} - {{\bar x}_L}} \right)/2\)

Then show that if n is odd and the two averages are calculated after excluding the median from each half, replacing non the left with n-1 gives the correct result.(Hint: Break the sum into two parts, the first involving observations less than or equal to the median and the second involving observations greater than or equal to the median.)

Short Answer

Expert verified

\(\frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{n} = \frac{{\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2}\)

Step by step solution

01

Given information

A sample of size n is \({x_1},{x_2},...,{x_n}\)with n even.

\({\bar x_L}\) and \({\bar x_U}\) denote the average of the smallest n/2 and the largest n/2 observations, respectively.

02

Derivation of the proof

Let\(n = 2d\left( {even} \right)\).

The ascending of n observations is,

\({X_1},{X_2},...,{X_d},{X_{d + 1}},...,{X_{2d}}\).

The sample mean is given as,

\(\begin{aligned}{{\bar X}_L} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_i}} \\{{\bar X}_U} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_{d + i}}} \end{aligned}\)

The sample median is given as,

\(\tilde X = {X_{d + 1}}\)

Further calculations are computed as,

\(\begin{aligned}\frac{{n\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= d\left( {{{\bar X}_U} - {{\bar X}_L}} \right)\\ &= \sum\limits_{i = 1}^d {{X_{d + i}}} - \sum\limits_{i = 1}^d {{X_i}} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + i}} - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + i}} - \tilde X + \tilde X - {X_i}} \right)} \,\;\;\;\;\;\left( {Adding\;and\;substracting\;median} \right)\\ &= \sum\limits_{i = 1}^d {\left| {{X_{d + i}} - \tilde X} \right|} + \sum\limits_{i = 1}^d {\left| {\tilde X - {X_i}} \right|} \\ &= \sum\limits_{i = 1}^{2d} {\left| {{X_{d + i}} - \tilde X} \right|} + \sum\limits_{i = 1}^d {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^{2d} {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Therefore,

\(\begin{aligned}\frac{{n\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} = \sum\limits_{i = 1}^{2d} {\left| {{X_i} - \tilde X} \right|} \\ = \sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Thus,

\(\frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{n} = \frac{{\left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2}\)

For odd number of observations, \(n = 2d + 1\).

The ascending of n observations is,

\({X_1},{X_2},...,{X_d},{X_{d + 1}},...,{X_{2d + 1}}\).

The sample mean is given as,

\(\begin{aligned}{{\bar X}_L} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_i}} \\{{\bar X}_U} &= \frac{1}{d}\sum\limits_{i = 1}^d {{X_{d + 1 + i}}} \end{aligned}\)

The sample median is given as,

\(\tilde X = {X_{d + 1}}\)

The calculations are as follows,

\(\begin{aligned}\frac{{\left( {n - 1} \right) * \left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= d\left( {{{\bar X}_U} - {{\bar X}_L}} \right)\\ &= \sum\limits_{i = 1}^d {{X_{d + 1 + i}}} - \sum\limits_{i = 1}^d {{X_i}} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + 1 + i}} - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left( {{X_{d + 1 + i}} - \tilde X + \tilde X - {X_i}} \right)} \\ &= \sum\limits_{i = 1}^d {\left| {{X_{d + 1 + i}} - \tilde X} \right| + \sum\limits_{I = 1}^d {\left| {\tilde X - {X_i}} \right|} } \\ &= \sum\limits_{i = d + 2}^{2d + 1} {\left| {{X_i} - \tilde X} \right| + \sum\limits_{I = 1}^d {\left| {{X_i} - \tilde X} \right|} } \\ &= \sum\limits_{i = 1}^d {\left| {{X_i} - \tilde X} \right| + \left| {\tilde X - \tilde X} \right|} + \sum\limits_{i = d + 2}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \end{aligned}\)

Therefore,

\(\begin{aligned}\frac{{\left( {n - 1} \right) * \left( {{{\bar X}_U} - {{\bar X}_L}} \right)}}{2} &= \sum\limits_{i = 1}^{2d + 1} {\left| {{X_i} - \tilde X} \right|} \\ &= \sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} \\ &= \frac{{\sum\limits_{i = 1}^n {\left| {{X_i} - \tilde X} \right|} }}{{n - 1}}\\ &= \frac{{{{\bar X}_U} - {{\bar X}_L}}}{2}\end{aligned}\)

Hence Proved.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Consider the following information on ultimate tensile strength (lb/in) for a sample of n = 4 hard zirconium copper wire specimens (from “Characterization Methods forFine Copper Wire,” Wire J. Intl., Aug., 1997: 74–80):mean = 76,831 s = 180 smallest = 76,683 largest = 77,048

Determine the values of the two middle sample observations (and don’t do it by successive guessing!)

Each customer making a particular Internet purchase must pay with one of three types of credit cards (think Visa, MasterCard, AmEx). Let \({{\rm{A}}_{\rm{i}}}{\rm{(i = 1,2,3)}}\) be the event that a type \({\rm{i}}\)credit card is used, with\({\rm{P}}\left( {{{\rm{A}}_{\rm{1}}}} \right){\rm{ = }}{\rm{.5}}\), \({\rm{P}}\left( {{{\rm{A}}_{\rm{2}}}} \right){\rm{ = }}{\rm{.3}}\), and\({\rm{P}}\left( {{{\rm{A}}_{\rm{3}}}} \right){\rm{ = }}{\rm{.2}}\). Suppose that the number of customers who make such a purchase on a given day is a Poisson \({\rm{rv}}\) with parameter\({\rm{\lambda }}\). Define \({\rm{rv}}\) 's \({{\rm{X}}_{{\rm{1,}}}}{{\rm{X}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}\)by \({{\rm{X}}_{\rm{i}}}{\rm{ = }}\)the number among the \({\rm{N}}\)customers who use a type \({\rm{i}}\)card\({\rm{(i = 1,2,3)}}\). Show that these three \({\rm{rv}}\) 's are independent with Poisson distributions having parameters\({\rm{.5\lambda ,}}{\rm{.3\lambda }}\), and\({\rm{.2\lambda }}\), respectively. (Hint: For non-negative integers\({{\rm{x}}_{\rm{1}}}{\rm{,}}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{x}}_{\rm{3}}}\), let\({\rm{n = }}{{\rm{x}}_{\rm{1}}}{\rm{ + }}{{\rm{x}}_{\rm{2}}}{\rm{ + }}{{\rm{x}}_{\rm{3}}}\). Then\({\rm{P}}\left( {{{\rm{X}}_{\rm{1}}}{\rm{ = }}{{\rm{x}}_{\rm{1}}}} \right.\), \(\left. {{{\rm{X}}_{\rm{2}}}{\rm{ = }}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}{\rm{ = }}{{\rm{x}}_{\rm{3}}}} \right){\rm{ = P}}\left( {{{\rm{X}}_{\rm{1}}}{\rm{ = }}{{\rm{x}}_{\rm{1}}}{\rm{,}}{{\rm{X}}_{\rm{2}}}{\rm{ = }}{{\rm{x}}_{\rm{2}}}{\rm{,}}{{\rm{X}}_{\rm{3}}}{\rm{ = }}{{\rm{x}}_{\rm{3}}}{\rm{,N = n}}} \right)\) (why is this?). Now condition on\({\rm{N = n}}\), in which case the three \({\rm{Xi}}\) 's have a trinomial distribution (multinomial with three categories) with category probabilities\({\rm{.5,}}{\rm{.3}}\), and . \({\rm{2}}\).)

Allowable mechanical properties for structural design of metallic aerospace vehicles requires an approved method for statistically analyzing empirical test data. The article “Establishing Mechanical Property Allowables for Metals” (J. of Testing and Evaluation, 1998: 293–299) used the accompanying data on tensile ultimate strength (ksi) as a basis for addressing the difficulties in developing such a method.

122.2 124.2 124.3 125.6 126.3 126.5 126.5 127.2 127.3

127.5 127.9 128.6 128.8 129.0 129.2 129.4 129.6 130.2

130.4 130.8 131.3 131.4 131.4 131.5 131.6 131.6 131.8

131.8 132.3 132.4 132.4 132.5 132.5 132.5 132.5 132.6

132.7 132.9 133.0 133.1 133.1 133.1 133.1 133.2 133.2

133.2 133.3 133.3 133.5 133.5 133.5 133.8 133.9 134.0

134.0 134.0 134.0 134.1 134.2 134.3 134.4 134.4 134.6

134.7 134.7 134.7 134.8 134.8 134.8 134.9 134.9 135.2

135.2 135.2 135.3 135.3 135.4 135.5 135.5 135.6 135.6

135.7 135.8 135.8 135.8 135.8 135.8 135.9 135.9 135.9

135.9 136.0 136.0 136.1 136.2 136.2 136.3 136.4 136.4

136.6 136.8 136.9 136.9 137.0 137.1 137.2 137.6 137.6

137.8 137.8 137.8 137.9 137.9 138.2 138.2 138.3 138.3

138.4 138.4 138.4 138.5 138.5 138.6 138.7 138.7 139.0

139.1 139.5 139.6 139.8 139.8 140.0 140.0 140.7 140.7

140.9 140.9 141.2 141.4 141.5 141.6 142.9 143.4 143.5

143.6 143.8 143.8 143.9 144.1 144.5 144.5 147.7 147.7

a. Construct a stem-and-leaf display of the data by first deleting (truncating) the tenths digit and then repeating each stem value five times (once for leaves 1 and 2, a second time for leaves 3 and 4, etc.). Why is it relatively easy to identify a representative strength value?

b. Construct a histogram using equal-width classes with the first class having a lower limit of 122 and an upper limit of 124. Then comment on any interesting features of the histogram.

Let \({\rm{X}}\) be the total medical expenses (in \({\rm{1000}}\) s of dollars) incurred by a particular individual during a given year. Although \({\rm{X}}\) is a discrete random variable, suppose its distribution is quite well approximated by a continuous distribution with pdf \({\rm{f(x) = k(1 + x/2}}{\rm{.5}}{{\rm{)}}^{{\rm{ - 7}}}}\) for.

a. What is the value of\({\rm{k}}\)?

b. Graph the pdf of \({\rm{X}}\).

c. What are the expected value and standard deviation of total medical expenses?

d. This individual is covered by an insurance plan that entails a \({\rm{\$ 500}}\) deductible provision (so the first \({\rm{\$ 500}}\) worth of expenses are paid by the individual). Then the plan will pay \({\rm{80\% }}\) of any additional expenses exceeding \({\rm{\$ 500}}\), and the maximum payment by the individual (including the deductible amount) is\({\rm{\$ 2500}}\). Let \({\rm{Y}}\) denote the amount of this individual's medical expenses paid by the insurance company. What is the expected value of\({\rm{Y}}\)?

(Hint: First figure out what value of \({\rm{X}}\) corresponds to the maximum out-of-pocket expense of \({\rm{\$ 2500}}\). Then write an expression for \({\rm{Y}}\) as a function of \({\rm{X}}\) (which involves several different pieces) and calculate the expected value of this function.)

The National Health and Nutrition Examination Survey (NHANES) collects demographic, socioeconomic, dietary, and health related information on an annual basis. Here is a sample of \({\rm{20}}\) observations on HDL cholesterol level \({\rm{(mg/dl)}}\) obtained from the \({\rm{2009 - 2010}}\) survey (HDL is “good” cholesterol; the higher its value, the lower the risk for heart disease):

\(\begin{array}{l}{\rm{35 49 52 54 65 51 51}}\\{\rm{47 86 36 46 33 39 45}}\\{\rm{39 63 95 35 30 48}}\end{array}\)

a. Calculate a point estimate of the population mean HDL cholesterol level.

b. Making no assumptions about the shape of the population distribution, calculate a point estimate of the value that separates the largest \({\rm{50\% }}\) of HDL levels from the smallest \({\rm{50\% }}\).

c. Calculate a point estimate of the population standard deviation.

d. An HDL level of at least \({\rm{60}}\) is considered desirable as it corresponds to a significantly lower risk of heart disease. Making no assumptions about the shape of the population distribution, estimate the proportion \({\rm{p}}\) of the population having an HDL level of at least \({\rm{60}}\).

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free