Chapter 3: Problem 11

Suppose that the population model determining $y$ is $$ y=\beta_{0}+\beta_{1} x_{1}+\beta_{2} x_{2}+\beta_{3} x_{3}+u $$ and this model satisfies Assumptions MLR.1, MLR.2, MLR.3 and MLR.4. However, we estimate the model that omits $x_{3} .$ Let $\bar{\beta}_{0}, \bar{\beta}_{1},$ and $\bar{\beta}_{2}$ be the OLS estimators from the regression of $y$ on $x_{1}$ and $x_{2}$ Show that the expected value of $\tilde{\beta}_{1}$ (given the values of the independent variables in the sample) is $$\mathbf{E}\left(\tilde{\beta}_{1}\right)=\beta_{1}+\beta_{3} \frac{\sum_{i=1}^{n} \hat{r}_{i 1} x_{i 3}}{\sum_{i=1}^{n} \hat{r}_{i 1}^{2}}$$ where the $\hat{r}_{i 1}$ are the OLS residuals from the regression of $x_{1}$ on $x_{2}$. [Hint: The formula for $\tilde{\beta}_{1}$ comes from equation $(3.22) .$ Plug $y_{i}=\beta_{0}+\beta_{1} x_{11}+\beta_{2} x_{12}+\beta_{3} x_{13}+u_{i}$ into this equation. After some algebra, take the expectation treating $x_{i 3}$ and $\tilde{r}_{i 1}$ as nonrandom.]

Short Answer

Expert verified

E(\tilde{\beta}_{1}) = \beta_{1} + \beta_{3} (\sum \hat{r}_{i1} x_{i3} / \sum \hat{r}_{i1}^2).

Step by step solution

Understand the Given Model

The given population model is: \[ y = \beta_{0} + \beta_{1} x_{1} + \beta_{2} x_{2} + \beta_{3} x_{3} + u \] We estimate this model omitting $x_3$, producing the estimators $\bar{\beta}_{0}, \bar{\beta}_{1}, \bar{\beta}_{2}$ for the regression of $y$ on $x_1$ and $x_2$.

Write the Reduced Model

The reduced model without $x_3$ is: \[ y_i = \bar{\beta}_{0} + \tilde{\beta}_{1} x_{i1} + \bar{\beta}_{2} x_{i2} + \tilde{u}_i \] where $\tilde{u}_i = \beta_{3} x_{i3} + u_i$.

Rewriting Based on the Full Model

Substitute $y_i$ from the full model into the reduced model equation: \[ y_i = \beta_{0} + \beta_{1} x_{i1} + \beta_{2} x_{i2} + \beta_{3} x_{i3} + u_i = \bar{\beta}_{0} + \tilde{\beta}_{1} x_{i1} + \bar{\beta}_{2} x_{i2} + \tilde{u}_i \] Reorganize this to focus on $\tilde{\beta}_{1}$: \[ \tilde{\beta}_{1} x_{i1} = (\beta_{1} + \beta_{3} x_{i3} - \beta_{2} x_{i2}) x_{i1} + (u_i - \tilde{u}_i) + \bar{\beta}_{2} x_{i2} - \bar{\beta}_{0} \] This indicates a relationship that allows us to find the expectation of $\tilde{\beta}_{1}$.

Find Expected Value of $\tilde{\beta}_{1}$

The formula for $\tilde{\beta}_{1}$ involves projecting $y$ onto $x_1$ and computing an expectation: \[ \mathbf{E}(\tilde{\beta}_{1}) = \beta_{1} + \beta_{3} \frac{\sum_{i=1}^{n} \hat{r}_{i1} x_{i3}}{\sum_{i=1}^{n} \hat{r}_{i1}^{2}} \] where $\hat{r}_{i1}$ are the OLS residuals obtained from the regression of $x_1$ on $x_2$. This step follows by recognizing the omitted variable bias represented by the correlation of $x_1$ and $x_3$ mediated by $\hat{r}_{i1}$.

Unlock Step-by-Step Solutions & Ace Your Exams!

Full Textbook Solutions
Get detailed explanations and key concepts
Unlimited Al creation
Al flashcards, explanations, exams and more...
Ads-free access
To over 500 millions flashcards
Money-back guarantee
We refund you if you fail your exam.

Start your free trial

Over 30 million students worldwide already upgrade their learning with Vaia!

Key Concepts

These are the key concepts you need to understand to accurately answer the question.

Ordinary Least Squares (OLS)

Ordinary Least Squares (OLS) is a method used in linear regression to estimate the parameters of a linear model. The goal of OLS is to minimize the sum of the squared differences between the observed values and the values predicted by the model. This method is widely used because it provides the best linear unbiased estimators (BLUE) when certain assumptions are met.

OLS works by fitting a line through data points in a way that the distance from the points to the line is as small as possible. This is often visualized in simple linear regression, where a line is drawn to represent the relationship between a dependent variable and an independent variable.

The basic idea is to find coefficients that minimize the discrepancy (residuals) between the actual data values and the values predicted by the linear model.
In the context of multiple regression, OLS extends to fitting a plane or a hyperplane in higher dimensions, attempting to capture relationships between one dependent variable and multiple independent variables.

OLS is also foundational to understanding concepts like omitted variable bias, as in the exercise, where one predictor variable is left out of the model.

Linear Regression Assumptions

For Ordinary Least Squares (OLS) estimators to be considered the best linear unbiased estimators (BLUE), certain assumptions about the linear regression model must be satisfied. These assumptions ensure that the OLS method produces reliable and valid results.

Linearity: The relationship between the independent and dependent variables should be linear.
Independence: The residuals (errors) should be independent across observations.
Homoscedasticity: The variance of the residuals should be constant across all levels of the independent variables.
No perfect multicollinearity: The independent variables should not be too highly correlated.

In the given exercise, the regression model originally satisfied these assumptions (MLR.1 to MLR.4). However, omitting one variable, such as $x_3$, can lead to biases in estimators due to the violation of these assumptions. The omitted variable may cause omitted variable bias if it is correlated with both the dependent variable and one or more included variables, distorting the estimation of coefficients.

Expected Value of Estimators

The expected value of an estimator is one of the key properties that help us understand its effectiveness. In simple terms, an estimator is a rule or formula that tells us how to calculate an estimate from a given set of data. The expected value of an estimator is its average value over a large number of samples. For an unbiased estimator, this expected value equals the true parameter it estimates.

In the exercise, you were shown how the expected value of $\tilde{\beta}_1$ is affected by the omission of $x_3$. Instead of simply representing $\beta_1$, the expected value of the estimator is shifted by a term that represents the influence of the omitted variable. This is an example of omitted variable bias:

Omitted Variable Bias occurs when a relevant variable is left out of the model, causing the estimator to capture not only the effect of the included variable but also the effect of the omitted variable.
The formula for $\mathbf{E}(\tilde{\beta}_1) = \beta_1 + \beta_3 \frac{\sum_{i=1}^{n} \hat{r}_{i1} x_{i3}}{\sum_{i=1}^{n} \hat{r}_{i1}^{2}}$ shows how the bias is directly related to the residuals and the omitted variable.

Understanding the expected value of estimators helps in diagnosing and correcting for biases, leading to more accurate and truthful interpretations of regression analysis results.

Recommended explanations on History Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Short Answer

Step by step solution

Understand the Given Model

Write the Reduced Model

Rewriting Based on the Full Model

Find Expected Value of \(\tilde{\beta}_{1}\)

Key Concepts

Ordinary Least Squares (OLS)

Linear Regression Assumptions

Expected Value of Estimators

One App. One Place for Learning.

Most popular questions from this chapter

Recommended explanations on History Textbooks

Crime and Punishment in Britain

Birth of the USA

The Crusades

Modern World History

Tsarist and Communist Russia

The Tudors

Study anywhere. Anytime. Across all devices.

Company

Product

Help