Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Often in regression the mean of the random variable \(Y\) is a linear function of \(p\) -values \(x_{1}, x_{2}, \ldots, x_{p}\), say \(\beta_{1} x_{1}+\beta_{2} x_{2}+\cdots+\beta_{p} x_{p}\), where \(\beta^{\prime}=\left(\beta_{1}, \beta_{2}, \ldots, \beta_{p}\right)\) are the regression coefficients. Suppose that \(n\) values, \(\boldsymbol{Y}^{\prime}=\left(Y_{1}, Y_{2}, \ldots, Y_{n}\right)\), are observed for the \(x\) -values in \(\boldsymbol{X}=\left[x_{i j}\right]\), where \(\boldsymbol{X}\) is an \(n \times p\) design matrix and its ith row is associated with \(Y_{i}, i=1,2, \ldots, n\). Assume that \(\boldsymbol{Y}\) is multivariate normal with mean \(\boldsymbol{X} \boldsymbol{\beta}\) and variance-covariance matrix \(\sigma^{2} \boldsymbol{I}\), where \(\boldsymbol{I}\) is the \(n \times n\) identity matrix. (a) Note that \(Y_{1}, Y_{2}, \ldots, Y_{n}\) are independent. Why? (b) Since \(\boldsymbol{Y}\) should approximately equal its mean \(\boldsymbol{X} \boldsymbol{\beta}\), we estimate \(\boldsymbol{\beta}\) by solving the normal equations \(\boldsymbol{X}^{\prime} \boldsymbol{Y}=\boldsymbol{X}^{\prime} \boldsymbol{X} \boldsymbol{\beta}\) for \(\boldsymbol{\beta}\). Assuming that \(\boldsymbol{X}^{\prime} \boldsymbol{X}\) is non- singular, solve the equations to get \(\hat{\beta}=\left(\boldsymbol{X}^{\prime} \boldsymbol{X}\right)^{-1} \boldsymbol{X}^{\prime} \boldsymbol{Y}\). Show that \(\hat{\boldsymbol{\beta}}\) has a multivariate normal distribution with mean \(\boldsymbol{\beta}\) and variance- covariance matrix $$ \sigma^{2}\left(\boldsymbol{X}^{\prime} \boldsymbol{X}\right)^{-1} $$ (c) Show that $$ (\boldsymbol{Y}-\boldsymbol{X} \boldsymbol{\beta})^{\prime}(\boldsymbol{Y}-\boldsymbol{X} \boldsymbol{\beta})=(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})^{\prime}\left(\boldsymbol{X}^{\prime} \boldsymbol{X}\right)(\hat{\boldsymbol{\beta}}-\boldsymbol{\beta})+(\boldsymbol{Y}-\boldsymbol{X} \hat{\boldsymbol{\beta}})^{\prime}(\boldsymbol{Y}-\boldsymbol{X} \hat{\boldsymbol{\beta}}) $$ For the remainder of the exercise, let \(Q\) denote the quadratic form on the left side of this expression and \(Q_{1}\) and \(Q_{2}\) denote the respective quadratic forms on the right side. Hence, \(Q=Q_{1}+Q_{2}\). (d) Show that \(Q_{1} / \sigma^{2}\) is \(\chi^{2}(p)\). (e) Show that \(Q_{1}\) and \(Q_{2}\) are independent. (f) Argue that \(Q_{2} / \sigma^{2}\) is \(\chi^{2}(n-p)\). (g) Find \(c\) so that \(c Q_{1} / Q_{2}\) has an \(F\) -distribution. (h) The fact that a value \(d\) can be found so that \(P\left(c Q_{1} / Q_{2} \leq d\right)=1-\alpha\) could be used to find a \(100(1-\alpha) \%\) confidence ellipsoid for \(\boldsymbol{\beta} .\) Explain.

Short Answer

Expert verified
The values \(Y_{1}, Y_{2}, ..., Y_{n}\) are independent because the covariance matrix is a scalar multiple of the identity matrix. The estimator \(\hat{\beta}\) is derived as \((X^TX)^{-1}X^TY\) and follows a multivariate normal distribution. The given quadratic forms equation is proven through algebraic manipulations. \(Q_{1}\) and \(Q_{2}\), are independently distributed as \(\chi^2\) with \(p\) and \(n - p\) degrees of freedom respectively. A constant \(c\) is found such that \(cQ_1/Q_2\) has an F-distribution. Finally, a boundary value \(d\) is used to construct a 100(1 - \alpha)% confidence ellipsoid for \(\beta\).

Step by step solution

01

Reasoning about the independence of \(Y_{1}, Y_{2}, \ldots, Y_{n}\)

Examine the given scenario. It is stated that the vector \(Y\) is multivariate normal with mean \(X\beta\) and variance-covariance matrix \(\sigma^{2}I\), where \(I\) is the identity matrix. The diagonal elements of the covariance matrix represent the variances of the respective variables, and the off-diagonal elements represent the covariances. A covariance of zero between two variables indicates that they are statistically independent. Hence, since all off-diagonal elements of the matrix \(\sigma^{2}I\) are zero, all \(Y_{1}, Y_{2}, \ldots, Y_{n}\) are independent.
02

Solving for \(\hat{\beta}\) and its distribution

As given, solve the normal equations by setting \(X^TY = X^TX\beta\). From this, it can be derived that \(\hat{\beta} = (X^TX)^{-1}X^TY\), which is the ordinary least squares estimator for \(\beta\). Since \(\hat{\beta}\) is a linear transformation of the multivariate normally distributed variable \(Y\), it also follows a multivariate normal distribution with mean \(\beta\) and covariance matrix \(\sigma^{2}(X^TX)^{-1}\) by the properties of the multivariate normal distribution and linear transformations of it.
03

Proving the equality of quadratic forms

Define and prove the equality of quadratic forms through algebraic manipulations and the properties of transposes. Consider statements and properties such as \((A + B)^T = A^T + B^T\), \((AB)^T = B^TA^T\), \((A + B)^T(A + B) = A^TA + A^TB + B^TA + B^TB\), and \(X^TX\hat{\beta} = X^TY\). Combine these properties and statements to prove the provided equation.
04

Distributing chi-squared distributions to \(Q_{1}\) and \(Q_{2}\)

Consider that \(Q_1\) and \(Q_2\) are respectively chi-squared distributed with \(p\) and \(n - p\) degrees of freedom, because they are the sum of squared independent and normally distributed variables divided by their variances.
05

Discussing the independence of \(Q_{1}\) and \(Q_{2}\)

State that \(Q_1\) and \(Q_2\) are independent, as they are functions of disjoint sets of independent variables.
06

Applying the F-distribution

Recall the definition of the F-distribution which is the ratio of two independent chi-squared variables divided by their respective degrees of freedom. From this definition, find a constant \(c\) such that \(cQ_1/Q_2\) follows an F-distribution.
07

Interpreting confidence ellipsoid

Explain how determining a boundary value \(d\) such that the probability \(P(cQ_1/Q_2 \leq d) = 1 - \alpha\) can be used to define a 100(1 - \alpha)% confidence ellipsoid for \(\beta\). This probability indicates that, within the confidence ellipsoid, the true value of \(\beta\) can be found with a confidence level of 100(1 - \alpha)%, thus providing a range of plausible values for the regression parameters.

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Let \(X_{1}\) and \(X_{2}\) be two independent random variables. Let \(X_{1}\) and \(Y=\) \(X_{1}+X_{2}\) be \(\chi^{2}\left(r_{1}, \theta_{1}\right)\) and \(\chi^{2}(r, \theta)\), respectively. Here \(r_{1}

Using the notation of this section, assume that the means satisfy the condition that \(\mu=\mu_{1}+(b-1) d=\mu_{2}-d=\mu_{3}-d=\cdots=\mu_{b}-d .\) That is, the last \(b-1\) means are equal but differ from the first mean \(\mu_{1}\), provided that \(d \neq 0\). Let independent random samples of size \(a\) be taken from the \(b\) normal distributions with common unknown variance \(\sigma^{2}\). (a) Show that the maximum likelihood estimators of \(\mu\) and \(d\) are \(\hat{\mu}=\bar{X}_{. .}\) and $$ \hat{d}=\frac{\sum_{j=2}^{b} \bar{X}_{. j} /(b-1)-\bar{X}_{.1}}{b} $$ (b) Using Exercise \(9.2 .4\), find \(Q_{6}\) and \(Q_{7}=c \hat{d}^{2}\) so that, when \(d=0, Q_{7} / \sigma^{2}\) is \(\chi^{2}(1)\) and $$ \sum_{i=1}^{a} \sum_{j=1}^{b}\left(X_{i j}-\bar{X}_{. .}\right)^{2}=Q_{3}+Q_{6}+Q_{7} $$ (c) Argue that the three terms in the right-hand member of part (b), once divided by \(\sigma^{2}\), are independent random variables with chi-square distributions, provided that \(d=0\). (d) The ratio \(Q_{7} /\left(Q_{3}+Q_{6}\right)\) times what constant has an \(F\) -distribution, provided that \(d=0\) ? Note that this \(F\) is really the square of the two-sample \(T\) used to test the equality of the mean of the first distribution and the common mean of the other distributions, in which the last \(b-1\) samples are combined into one.

Under Model \((9.2 .1)\), show that the linear functions \(X_{i j}-\bar{X}_{. j}\) and \(\bar{X}_{. j}-\bar{X}_{\cdots}\) are uncorrelated. Hint: Recall the definition of \(\bar{X}_{. j}\) and \(\bar{X}_{. .}\) and, without loss of generality, we can let \(E\left(X_{i j}\right)=0\) for all \(i, j\).

Extend the Bonferroni procedure to simultaneous testing. That is, suppose we have \(m\) hypotheses of interest: \(H_{0 i}\) versus \(H_{1 i}, i=1, \ldots, m\). For testing \(H_{0 i}\) versus \(H_{1 i}\), let \(C_{i, \alpha}\) be a critical region of size \(\alpha\) and assume \(H_{0 i}\) is rejected if \(\mathbf{X}_{i} \in C_{i, \alpha}\), for a sample \(\mathbf{X}_{i} .\) Determine a rule so that we can simultaneously test these \(m\) hypotheses with a Type I error rate less than or equal to \(\alpha\).

Fit by the method of least squares the plane \(z=a+b x+c y\) to the five points \((x, y, z):(-1,-2,5),(0,-2,4),(0,0,4),(1,0,2),(2,1,0)\). Let the \(\mathrm{R}\) vectors \(\mathrm{x}, \mathrm{y}, \mathrm{z}\) contain the values for \(x, y\), and \(z .\) Then the \(\mathrm{LS}\) fit is computed by \(\operatorname{lm}(z-x+y)\)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free