Chapter 9: Problem 10

Often in regression the mean of the random variable $Y$ is a linear function of $p$ -values $x_{1}, x_{2}, \dots, x_{p}$ , say $β_{1} x_{1} + β_{2} x_{2} + \dots + β_{p} x_{p}$ , where $β^{'} = (β_{1}, β_{2}, \dots, β_{p})$ are the regression coefficients. Suppose that $n$ values, $Y^{'} = (Y_{1}, Y_{2}, \dots, Y_{n})$ are observed for the $x$ -values in $X = [x_{i j}]$ , where $X$ is an $n \times p$ design matrix and its ith row is associated with $Y_{i}, i = 1, 2, \dots, n .$ Assume that $Y$ is multivariate normal with mean $X β$ and variance-covariance matrix $σ^{2} I$ , where $I$ is the $n \times n$ identity matrix. (a) Note that $Y_{1}, Y_{2}, \dots, Y_{n}$ are independent. Why? (b) Since $Y$ should approximately equal its mean $X β$ , we estimate $β$ by solving the normal equations $X^{'} Y = X^{'} X β$ for $β$ . Assuming that $X^{'} X$ is non- singular, solve the equations to get $\hat{β} = {(X^{'} X)}^{- 1} X^{'} Y$ . Show that $\hat{β}$ has a multivariate normal distribution with mean $β$ and variance-covariance matrix $σ^{2} {(X^{'} X)}^{- 1}$ (c) Show that $(Y - X β)^{'} (Y - X β) = (\hat{β} - β)^{'} (X^{'} X) (\hat{β} - β) + (Y - X \hat{β})^{'} (Y - X \hat{β})$ say $Q = Q_{1} + Q_{2}$ for convenience. (d) Show that $Q_{1} / σ^{2}$ is $χ^{2} (p)$ . (e) Show that $Q_{1}$ and $Q_{2}$ are independent. (f) Argue that $Q_{2} / σ^{2}$ is $χ^{2} (n - p)$ . (g) Find $c$ so that $c Q_{1} / Q_{2}$ has an $F$ -distribution. (h) The fact that a value $d$ can be found so that $P (c Q_{1} / Q_{2} \leq d) = 1 - α$ could be used to find a $100 (1 - α)$ percent confidence ellipsoid for $β$ . Explain.

Short Answer

Expert verified

This exercise involves mathematical proofs related to several quantities derived from observations and predictors that are organized into a matrix in multiple linear regression. With normal distribution, identity covariance matrix and linear relationship between predictors and observations, we derived several important properties and relationships like normal distribution of estimates, independence of residuals, chi-square and F-distributions for squared quantities and statistical inference foundations for confidence intervals.

Step by step solution

Part (a)

Given that the variance-covariance matrix is

σ^{2} I

, where

I

is the identity matrix, it implies all the off-diagonal elements representing the covariance of any two different

Y

values are zero. Thus,

Y_{1}, Y_{2}, \dots, Y_{n}

are uncorrelated and since they are normal, they are also independent.

Part (b)

Given the normal equation

X^{'} Y = X^{'} X β

, we can solve for

β

to get the least squares estimates

\hat{β} = {(X^{'} X)}^{- 1} X^{'} Y

. Under normal distribution assumption, the vector

β

also has a multivariate normal distribution with mean

β

and variance-covariance matrix

σ^{2} {(X^{'} X)}^{- 1}

from elements variance for multivariate normal distribution.

Part (c)

To prove the equation, expand both sides, as

(a - b)^{'} (a - b) = a^{'} a - 2 a^{'} b + b^{'} b

. After simplification, both sides will be equal.

Part (d)

The quantity

Q_{1} / σ^{2} = (\hat{β} - β)^{'} (X^{'} X) (\hat{β} - β) / σ^{2}

follows a

χ^{2}

distribution with

p

degrees of freedom by the definition of chi-square distribution and proved under (b)

Part (e)

The quantities

Q_{1}

and

Q_{2}

can be shown to be independent because they are functions of uncorrelated random variables.

Part (f)

The quantity

Q_{2} / σ^{2} = (Y - X \hat{β})^{'} (Y - X \hat{β}) / σ^{2}

follows a

χ^{2}

distribution with

(n - p)

degrees of freedom, by property of chi-square distribution where each of

n - p

residuals

(Y - X \hat{β})

squares contributes 1 degree.

Part (g)

To give

c Q_{1} / Q_{2}

F

-distribution,

c

must be the ratio of the degrees of freedom of

Q_{1}

and

Q_{2}

, so

c = p / (n - p)

as they are the degrees of freedom from (d) and (f). An F-distribution describes ratio of two chi-square divided by their degrees of freedom.

Part (h)

Using

Q_{1}

and

Q_{2}

we can get an F distribution. We can also calculate a cut-off score

d

by defining our acceptable error to be

α

. Using the F-distribution table or function, we can find a value

d

that

P (c Q_{1} / Q_{2} \leq d) = 1 - α

. This is the basis of statistical testing and establishing confidence intervals. We can obtain a

100 (1 - α) %

confidence ellipsoid for

β

by creating bounds using the relationship of

β

to the F-distribution.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Start your free trial

Over 30 million students worldwide already upgrade their learning with Vaia!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Multivariate Normal Distribution

In regression analysis, understanding the multivariate normal distribution is crucial. It is a generalization of the univariate normal distribution to more than one variable. This concept comes into play when we examine the regression model

Y = X β + ϵ

where

$Y$ represents the vector of observed values,
$X$ is the design matrix containing rows that are associated with these observations,
$β$ is a vector of regression coefficients, and
$ϵ$ is the error term.

When the random variable

Y

follows a multivariate normal distribution with mean

X β

and a variance-covariance matrix

σ^{2} I

, it means that each element of

Y

is a normally distributed random variable, and they are independent of each other. This independence is due to the identity matrix

I

, which indicates zero covariance between different variables of

Y

. This feature of multivariate normal distribution helps simplify the process of estimating the regression parameters

β

Least Squares Estimation

Least squares estimation is a method used to estimate unknown parameters in a regression model. In simple terms, it minimizes the sum of squared differences between the observed and predicted values. For regression models, the normal equation used is

X^{'} Y = X^{'} X β

By solving this equation, we find the estimated coefficients, denoted as

\hat{β}

, given by

\hat{β} = (X^{'} X)^{- 1} X^{'} Y

This method is chosen because it provides the best linear unbiased estimator (BLUE) under the assumption that errors are normally distributed with constant variance. Furthermore, because the errors are assumed normally distributed,

\hat{β}

itself follows a multivariate normal distribution with mean

β

and variance

σ^{2} (X^{'} X)^{- 1}

. This property is particularly useful because it allows us to make statistical inferences about the regression coefficients.

Chi-Square Distribution

The chi-square distribution is essential in regression analysis for hypothesis testing and constructing confidence intervals. It is used as a measure of the distribution of variance. In the context of regression, we consider quantities such as

Q_{1} / σ^{2}

where

Q_{1} = (\hat{β} - β)^{'} (X^{'} X) (\hat{β} - β)

This expression yields a

χ^{2}

distribution with

p

degrees of freedom, where

p

is the number of predictors in the regression model. This is because the term

\hat{β} - β

is normally distributed, and its quadratic form with respect to the matrix

X^{'} X

produces a

χ^{2}

distribution. Another form related to the error or residual component,

Q_{2} / σ^{2}

where

Q_{2} = (Y - X \hat{β})^{'} (Y - X \hat{β})

also follows a chi-square distribution but with

n - p

degrees of freedom. Here,

n

stands for the number of observations. Understanding these distributions helps in evaluating the goodness-of-fit and conducting statistical tests in regression analysis.

Confidence Ellipsoid

A confidence ellipsoid is an extension of the confidence interval to multiple dimensions. In the regression context, it helps to give a bounded region where the true parameter

β

lies with a certain probability. By combining the chi-square distribution findings with the

F

-distribution, we can design a confidence ellipsoid for

β

. Using statistics like

c Q_{1} / Q_{2}

which follows an

F

-distribution, a critical value

d

can be determined such that

P (c Q_{1} / Q_{2} \leq d) = 1 - α

where

α

is the significance level.The ellipsoid is defined for a

100 (1 - α) %

confidence level. Geometrically, this region is shaped like an ellipsoid, illustrating the joint confidence in estimates of the multiple regression coefficients. This is crucial in assessing how far the estimated

\hat{β}

might stray from the actual

β

due to sampling variability, thus providing a multidimensional insight into the precision of our regression estimates.

Short Answer

Step by step solution

Part (a)

Part (b)

Part (c)

Part (d)

Part (e)

Part (f)

Part (g)

Part (h)

Key Concepts

Multivariate Normal Distribution

Least Squares Estimation

Chi-Square Distribution

Confidence Ellipsoid

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on Math Textbooks

Applied Mathematics

Pure Maths

Probability and Statistics

Statistics

Decision Maths

Logic and Functions

Study anywhere. Anytime. Across all devices.

Company

Product

Help