Model Misspecification
- Omitted variables
- Inappropriate form of variables
- Inappropriate variable scaling
- Inappropriate data pooling
Vialations of Regesstion Assumtions
Heteroskedasticity
- Unconditional heteroskedasticity: Heteroskedasticity of the error variance is not correlated with independent variables. Creates no major problems for statistical inference.
- Conditional heteroskedasticity: Heteroskedasticity of the error variance is correlated with the values of independent variables.
In this case, the coefficient esitimates
To test heteroskedasticity, we can use Breusch-Pagen
where
To correct Heteroskedasticity, we use adjusted robust standard errors to recalculate
Serial Correlation
The error term are correlated with another, typically in time series. Positive serial correlation will increase the chance of error term in same sign, vice versa for negative.
- Positive serial correlation: standard errors underestimated and t-statistics inflated
- Negative serial correlation: Overestimated standard errors and underestimated t-statistics
To test serial correlation, we use Durbin-Watson test:
- The result is in
. If near 4, negative serial correlation, If near 0, positive, If near 2, no serial correlation. - It is limited to first order serial correlation.
The Bresusch-Godfrey test:
or it can be extended with higher orders of
To correct serial correlation, we use adjusted Newey-West standard errors or modify the regression itself.
Multicolinearity
It refers to cases where 2 or more independent variables are highly but not perfectly correlated to each other. Estimates of correlation become inprecise and reliable.
- Similar to negative serial correlation, standard errors overestimated.
To test, we use VIF:
- The smaller the better
- If
, further investigation - If
, multicollinearity
To correct:
- Exclude variables
- Increase sample size