Regression Through Origin

Table of Contents

Introduction of Regression Through Origin Models

So far we have studied models like

$Y_i = \beta_0 + \beta_1 X_i + u_i$

Where intercept is present. An economic example of these models is the Keynes consumption function written as:

$\text{Consumption} = \beta_0 + \beta_1 \text{Income} + u$

Where $\beta_0$ is autonomous consumption i.e., level of consumption when income is zero.

In some cases, we wish to impose the restriction that, when x=0, the expected value of y is zero. There are certain relationships for which this is reasonable. For example, in the following model where tax revenue depends on income:

$\text{Tax Revenue} = \beta_1 \text{Income} + u$

If income (X) is zero, then income tax revenues (Y) must also be zero.

Consider another example from production theory:

$\text{Variable Cost} = \beta_1 \text{Output} + u$

If output is zero (X=0), the variable cost will also be zero (Y=0).

Definition of Regression Through Origin Models

A regression model in which intercept term ( ) is absent or zero is called regression through origin model because it passes through the origin i.e., where X=0, and Y=0. It can be written as:

$Y_i = \beta_1 X_i + u_i$

Economic Examples of Regression Through Origin Models

Instances where the zero-intercept model may be appropriate are

Milton Friedman’s permanent income hypothesis, which states that permanent consumption $C_P$ is proportional to permanent income, $Y_P$ , that is $C_P = k Y_P + u$ .
Cost theory which postulates that the variable cost of production is proportional to output.
Some versions of monetary theory which state that the rate of change of prices (i.e., the rate of inflation) is proportional to the rate of change of the money supply.

Estimation of Regression Through Origin Models

Corresponding to equation 1 we can write our sample regression function as.

$Y_i = \hat{\beta}_1 X_i + \hat{u}_i$

$\hat{Y}_i = \hat{\beta}_1 X_i$

To obtain the slope coefficient, we still rely on the method of ordinary least squares, which in this case minimizes the sum of squared residuals.

$\hat{\beta}_1 = \frac{\sum X_i Y_i}{\sum X_i^2}$

$\text{Var}(\hat{\beta}_1) = \frac{\sigma^2}{\sum X_i^2}$

And

$\hat{\sigma}^2 = \frac{\sum \hat{u}_i^2}{n - 1}$

It is interesting to compare these formulas with those obtained when the intercept term is included in the model.

$\hat{\beta}_1 = \frac{\sum x_i y_i}{\sum x_i^2}$

$\text{Var}(\hat{\beta}_1) = \frac{\sigma^2}{\sum x_i^2}$

$\hat{\sigma}^2 = \frac{\sum \hat{u}_i^2}{n - 2}$

Difference between Intercept and without Intercept Models

In the model with no intercept, we use raw sums of squares and cross products but in the model with intercept, we use deviations from mean sums of squares and cross products.
The degrees of freedom in model with intercept to estimate the variance of residual is n-2, but df in the model without intercept is n-1.
The r², the coefficient of determination is always non-negative for the conventional model, but for interceptless model it can turn out to be negative! This anomalous result arises because the r² explicitly assumes that the intercept is included in the model.

Consequences of using Zero Intercept Models.

In models with intercept sum of residuals is always zero but in models without intercept zero mean of residual is not necessary. Thus, we must use the interceptless models only when it is appropriate.
If we omit the constant term, then the impact of the constant is forced into the estimates of the other coefficients, causing potential bias.

Example:

Suppose we run a regression model both with and without intercept. Estimating a regression equation with a constant term would likely produce an estimated regression line very similar to the true regression line, which has a constant term quite different from zero. The slope of this estimated line is very low, and the t-score of the estimated slope coefficient may be very close to zero.

However, if the researcher estimates the model without intercept, which implies that the estimated regression line must pass through the origin, then the estimated regression line would result the slope coefficient is biased upward compared with the true slope coefficient. The t-score is biased upward as well, and it may indicate that the estimated slope coefficient is significantly positive. Such a conclusion would be incorrect.

Regression with and without intercept.

Coefficient of Determination for Regression Through Origin Model.

To calculate r²for models without intercept we can use the following formula.

$R^2_{\text{raw}} = \frac{\left(\sum X_i Y_i\right)^2}{\left(\sum X_i^2\right)\left(\sum Y_i^2\right)}$

Note that here cross products and sum of squares are in raw form i.e., not mean corrected that`s why we call r² of regression through origin as raw r².

Although this raw r² satisfies the relation 0 < r² < 1, it is not directly comparable to the conventional r² value. For this reason, some authors do not report the r² value for zero intercept regression models.

Conclusion

Because of these special features of this model, we need great caution in using the zero-intercept regression model. Unless there is a very priori expectation, one would be advised to use the conventional, intercept-present model. This has a dual advantage.

First, if the intercept term is included in the model but it turns out to be statistically insignificant (i.e., statistically equal to zero), for all practical purposes we have a regression through the origin.
Second, and more important, if in fact there is an intercept in the model, but we insist on fitting a regression through the origin, we would be committing a specification error.

Share this article

Muhammad Minhaj Akhtar

Muhammad Minhaj Akhtar is a Lecturer in Economics at Government Graduate College Jauharabad, Pakistan. He holds an M.Phil. in Economics from Quaid-i-Azam University, Islamabad, and an MSc in Economics from the University of Sargodha, where he earned a Silver Medal. His academic passion lies in Econometrics, with a strong focus on applying empirical methods to real-world economic issues. Through MinhajMetrixHub, he shares learning resources, research guidance, and practical econometric insights for students and researchers.

Read Posts

Regression Through Origin

Introduction of Regression Through Origin Models

Definition of Regression Through Origin Models

Economic Examples of Regression Through Origin Models

Estimation of Regression Through Origin Models

Difference between Intercept and without Intercept Models

Consequences of using Zero Intercept Models.

Example:

Coefficient of Determination for Regression Through Origin Model.

Conclusion

Share this article

Muhammad Minhaj Akhtar

Leave a Reply Cancel reply

CATEGORIES

TAGS

Recent Posts

Sen Capability Approach

Variance and Standard Error of OLS Estimators and Gauss-Markov Theorem

Assumptions of Classical Linear Regression Model (CLRM)

Education and Economic Development

A Numerical Example of Multiple Linear Regression by Hand