Warning: foreach() argument must be of type array|object, bool given in /var/www/html/web/app/themes/studypress-core-theme/template-parts/header/mobile-offcanvas.php on line 20

Let \(\overline x = \frac{1}{n}\left( {{x_1} + \cdots + {x_n}} \right)\), and \(\overline y = \frac{1}{n}\left( {{y_1} + \cdots + {y_n}} \right)\). Show that the least-squares line for the data \(\left( {{x_1},{y_1}} \right), \ldots ,\left( {{x_n},{y_n}} \right)\) must pass through \(\left( {\overline x ,\overline y } \right)\). That is, show that \(\overline x \) and \(\overline y \) satisfies the linear equation \(\overline y = {\hat \beta _0} + {\hat \beta _1}\overline x \). (Hint: Derive this equation from the vector equation \({\bf{y}} = X{\bf{\hat \beta }} + \in \). Denote the first column of \(X\) by 1. Use the fact that the residual vector \( \in \) is orthogonal to the column space of \(X\) and hence is orthogonal to 1.)

Short Answer

Expert verified

It is verified that, \(\bar y\) and \(\) satisfies the linear equation \(\bar y = {\beta _0} + \bar x{\beta _1}\).

Step by step solution

01

The General Linear Model

The equation of the general linear model is defined as:

\({\bf{y}} = X\beta + \in \)

Here, \({\bf{y}} = \left( {\begin{aligned}{{y_1}}\\{{y_2}}\\ \vdots \\{{y_n}}\end{aligned}} \right)\) is an observational vector, \(X = \left( {\begin{aligned}1&{{x_1}}& \cdots &{x_1^n}\\1&{{x_2}}& \cdots &{x_2^n}\\ \vdots & \vdots & \ddots & \vdots \\1&{{x_n}}& \cdots &{x_n^n}\end{aligned}} \right)\) is the design matrix, \(\beta = \left( {\begin{aligned}{{\beta _1}}\\{{\beta _2}}\\ \vdots \\{{\beta _n}}\end{aligned}} \right)\) is parameter vector, and \( \in = \left( {\begin{aligned}{{ \in _1}}\\{{ \in _2}}\\ \vdots \\{{ \in _n}}\end{aligned}} \right)\) is a residual vector.

02

Determine whether \(\overline x \) and \(\overline y \) satisfies the given linear equation 

Write the design matrix as\(X = \left( {\begin{aligned}1&{\bf{x}}\end{aligned}} \right)\).

As the equation of the general linear model is given by\({\bf{y}} = X\beta + \in \). Isolate residual vector from the equation.

\( \in = {\bf{y}} - X{\bf{\hat \beta }}\)

As the residual vector is orthogonal to columns of \(X\), so \(1 \cdot \in = 0\).

\(\begin{aligned}1 \cdot \left( {{\bf{y}} - X{\bf{\hat \beta }}} \right) = 0\\{1^T}{\bf{y}} - \left( {{1^T}X} \right){\bf{\hat \beta }} = 0\\\left( {\begin{aligned}1\\1\\ \vdots \\1\end{aligned}} \right)\left( {\begin{aligned}{{y_1}}\\{{y_2}}\\ \vdots \\{{y_n}}\end{aligned}} \right) - {\left( {\begin{aligned}1\\1\\ \vdots \\1\end{aligned}} \right)^T}\left( {\begin{aligned}{{x_1}}\\{{x_2}}\\ \vdots \\{{x_n}}\end{aligned}} \right) \cdot {\bf{\hat \beta }} = 0\\\left( {{y_1} + {y_2} + \cdots + {y_n}} \right) - \left( {\begin{aligned}{1 + 1 + \ldots n{\rm{ times}}}&{{x_1} + {x_2} + \cdots + {x_n}}\end{aligned}} \right) \cdot {\bf{\hat \beta }} = 0\\\sum y - \left( {\begin{aligned}n&{\sum x }\end{aligned}} \right)\left( {\begin{aligned}{{\beta _0}}\\{{\beta _1}}\end{aligned}} \right) = 0\\\sum y - \left( {n{\beta _0} + \sum x {\beta _1}} \right) = 0\end{aligned}\)

Write \(\sum y \) as \(n\bar y\) and \(\sum x \) as \(n\bar x\).

\(\begin{aligned}n\bar y - \left( {n{\beta _0} + n\bar x{\beta _1}} \right) = 0\\n\bar y - n{\beta _0} - n\bar x{\beta _1} = 0\\\bar y - {\beta _0} - \bar x{\beta _1} = 0\\\bar y = {\beta _0} + \bar x{\beta _1}\end{aligned}\)

This implies that \(\bar y\) and \(\bar x\) satisfies the linear equation \(\bar y = {\beta _0} + \bar x{\beta _1}\).

Unlock Step-by-Step Solutions & Ace Your Exams!

  • Full Textbook Solutions

    Get detailed explanations and key concepts

  • Unlimited Al creation

    Al flashcards, explanations, exams and more...

  • Ads-free access

    To over 500 millions flashcards

  • Money-back guarantee

    We refund you if you fail your exam.

Over 30 million students worldwide already upgrade their learning with Vaia!

One App. One Place for Learning.

All the tools & learning materials you need for study success - in one app.

Get started for free

Most popular questions from this chapter

Find a \(QR\) factorization of the matrix in Exercise 11.

In Exercises 1-4, find the equation \(y = {\beta _0} + {\beta _1}x\) of the least-square line that best fits the given data points.

  1. \(\left( { - 1,0} \right),\left( {0,1} \right),\left( {1,2} \right),\left( {2,4} \right)\)

A simple curve that often makes a good model for the variable costs of a company, a function of the sales level \(x\), has the form \(y = {\beta _1}x + {\beta _2}{x^2} + {\beta _3}{x^3}\). There is no constant term because fixed costs are not included.

a. Give the design matrix and the parameter vector for the linear model that leads to a least-squares fit of the equation above, with data \(\left( {{x_1},{y_1}} \right), \ldots ,\left( {{x_n},{y_n}} \right)\).

b. Find the least-squares curve of the form above to fit the data \(\left( {4,1.58} \right),\left( {6,2.08} \right),\left( {8,2.5} \right),\left( {10,2.8} \right),\left( {12,3.1} \right),\left( {14,3.4} \right),\left( {16,3.8} \right)\) and \(\left( {18,4.32} \right)\), with values in thousands. If possible, produce a graph that shows the data points and the graph of the cubic approximation.

In exercises 1-6, determine which sets of vectors are orthogonal.

\(\left[ {\begin{array}{*{20}{c}}5\\{ - 4}\\0\\3\end{array}} \right]\), \(\left[ {\begin{array}{*{20}{c}}{ - 4}\\1\\{ - 3}\\8\end{array}} \right]\), \(\left[ {\begin{array}{*{20}{c}}3\\3\\5\\{ - 1}\end{array}} \right]\)

Let \(X\) be the design matrix in Example 2 corresponding to a least-square fit of parabola to data \(\left( {{x_1},{y_1}} \right), \ldots ,\left( {{x_n},{y_n}} \right)\). Suppose \({x_1}\), \({x_2}\) and \({x_3}\) are distinct. Explain why there is only one parabola that best, in a least-square sense. (See Exercise 5.)

See all solutions

Recommended explanations on Math Textbooks

View all explanations

What do you think about this solution?

We value your feedback to improve our textbook solutions.

Study anywhere. Anytime. Across all devices.

Sign-up for free