Chapter 12: Q5E (page 849)

Use the data in Table 10.6 on page 640. We are interested in the bias of the sample median as an estimator of the median of the distribution.
a. Use the non-parametric bootstrap to estimate this bias.
b. How many bootstrap samples does it appear that you need in order to estimate the bias to within .05 with a probability of 0.99?

Short Answer

Expert verified

a. The Estimate of the bias is equal to -1.615.

b. Around n = 39096.

Step by step solution

(a) To Use the non-parametric bootstrap to estimate this bias.

The fact that there are no assumptions for the distribution F means that the non-parametric bootstrap shall be used. The estimator of interest is the bias of the sample median.

To initialize the process, the sample median of the initial sample shall be computed. First, calculate the difference between the sample median of the generated samples from the distribution and the median of the original sample to find the bootstrap bias estimate. The bootstrap bias estimate is then the average of these values. The following code gives it in RStudio. Note that the variable of interest is Sulfur Dioxide.

The sample median from the initial sample is equal to 26. The approximation of the bias, or the estimated bias, is -1.615. The number of samples used was\(n = 25000 = \nu \) . Note that this number will change every time you run the code.

(b) To find the bootstrap samples and estimate the bias

For the bias to be between -0.05 and 0.05 from the actual bias, one should look into the following probability

\(P( - 0.05 < \theta < 0.05) = 0.99\)

As in the exercise, it is given that the probability shall be at least 0.99. Then, one would need the estimator's expected value and the estimator's standard deviation. Instead, take that the sample size is large and use the simulated sample standard deviation as an estimate (note here one should use the sample standard deviation of the n generated sample medians) and set the mean to be equal to zero. Then,

\(\begin{aligned}{}P( - 0.05 < \theta < 0.05) = P\left( {\frac{{ - 0.05 - 0}}{{\sqrt {4.25/n} }} < \frac{{\theta - 0}}{{\sqrt {4.25/n} }} < \frac{{0.05 - 0}}{{\sqrt {4.25/n} }}} \right)\\\\ = P\left( {\frac{{ - 0.05 - 0}}{{\sqrt {4.25/n} }} < Z < \frac{{0.05 - 0}}{{\sqrt {4.25/n} }}} \right)\\\\ = 0.99,\end{aligned}\)

or equally, because P(Z<2.576) =0.995 (standard normal is symmetric), it is true that

\(\frac{{0.05}}{{\sqrt {4.25/n} }} = 2.576\)

which leads to

\(n = {\left( {\frac{{4.25}}{{0.04}} \times 2.576} \right)^2} = 39096\)

Instead of using the initial n=25000, use \(n \ge 39096\) it to obtain the desired result. By rerunning the code with a changed value of n, the standard error of the simulation will be close to 0.021<0.05. The Estimate of the bias is close to -1.679.

#The library for built-in bootstrap functions

library(boot)

#read data

Users

Exercise5.txt",

#a column instead of a list

\(Sulfur Dioxide = matrix (unlist (data (1) ))\)

#a way to find the sample median

Original Sample median = median (Sulfur Dioxide)

#the number of bootstrap samples

\(n = 39096\)

\(Xstarimedian = numeric \left( n \right)\)

\(SampleBias = numeric(n)\)

#Generate sample \({X^{ \wedge *}}i\)

and the biases

for \(\left( { i in \left( {1: n} \right)} \right)\){

#generate the bootstrap sample with replacement

\(Xstari = sample(SulfurDioxide,length(SulfurDioxide),replace = T)\)

#median of the bootstrap sample

\(XstariMedian(i) = median(Xstari)\)

#bias

\(SampleBias \left( i \right) = XstariMedian\left( i \right) - OriginalSampleMedian\)

}

#average the differences

Bootstrap Estimate median = mean (Sample Bias)

#the second part

Bootstrap Estimate SD Median = \(sd (SampleBias)/sqrt (n)\)

#the Estimate of the standard deviation of the n sample medians

\(\begin{aligned}{{}{}}{BootstrapEstimateSD = sd \left( {Xstarimedian } \right)}\\{}\end{aligned}\)

\(BootstrapEstimateMean = mean \left( {XstariMedian} \right)\)

\(n = {( qnorm (0.995,0,1)* BootstrapEstimate SD /0.05)^ \wedge }2\)

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Start your free trial

Over 30 million students worldwide already upgrade their learning with Vaia!

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Short Answer

Step by step solution

(a) To Use the non-parametric bootstrap to estimate this bias.

(b) To find the bootstrap samples and estimate the bias

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Math Textbooks

Calculus

Geometry

Logic and Functions

Theoretical and Mathematical Physics

Mechanics Maths

Discrete Mathematics

Study anywhere. Anytime. Across all devices.

Company

Product

Help