COMPARATIVE ANALYSIS OF RIDGE AND PRINCIPAL COMPONENT REGRESSION IN ADDRESSING MULTICOLLINEARITY
Abstract
Multicollinearity arises when two or more regressors are correlated in multiple linear regression model (MLRM) and in most cases, one regressor variable can be predicted from another. Multicollinearity majorly results in inefficient regression model estimates and poor performance of the regression model. However, multicollinearity problem can easily be handled using various methods such as ridge regression, lasso regression, principal components regression, etc. This study compared the effectiveness of two estimators in handling multicollinearity problem in a given dataset. The estimators being compared are ridge estimator (RE) and principal components estimator (PCE). This research uses secondary data obtained from World Bank database, International Monetary Fund (IMF) database, and the Nigerian Debt Management Office to compare the two approaches of handling multicollinearity problem in MLRM. The presence of multicollinearity in the dataset was established using the correlation matrix of predictors and the Variance Inflation Factors (VIF's). Then ridge regression and principal components regression methods were used to fit models to the dataset respectively and their mean squared errors (MSE) were obtained. The MSE was used as performance evaluation measure for the regression models. Both methods addressed the problem multicollinearity in the datasets but the ridge estimator performed better than PCE by having the smallest mean squared error.
References
Arum, K. C., & Ugwuowo, F. I. (2022). Combining principal component and robust ridge estimators in linear regression model with multicollinearity. Concurrency and Computation Practice and Experience, 34 (10), 12 20. https://doi.org/10.1002/cpe.6803 DOI: https://doi.org/10.1002/cpe.6803
Ayinde, K., Ugochinyere N., & Olusegun O. (2021). Solving multicollinearity problem in linear regression model: The review suggests new idea of partitioning and extraction of the explanatory variables. Journal of Mathematics and Statistics Studies, 2 (1), 12 20. DOI: https://doi.org/10.32996/jmss.2021.2.1.2
El-Dereny, M., & Rashwan, N. I. (2011): Solving multicollinearity problem using ridge regression models. International Journal of Contemporary Mathematical Sciences, 6(12), 585-600.
Hoerl A. E., & Kennard R.W (1970). Ridge Regression: Applications to Non-Orthogonal Problems. Journal of Technometrics, 12 (1), 69-82. DOI: https://doi.org/10.1080/00401706.1970.10488635
Jegede, S. L., Lukman, A. F., Ayinde, K., & Odeniyi K. A. (2022). Jackknife Kibria-Lukman M-Estimator: Simulation and Application. DOI: https://doi.org/10.46481/jnsps.2022.664
Kibria, G. B. M., & Lukman, A. F. (2020). A new ridge-type estimator for the linear regression model: Simulations and applications. Scientifica, 2020, 116. https://doi.org/10.1155/2020/9758378 DOI: https://doi.org/10.1155/2020/9758378
Lukman, A.F., Ayinde K., Olusegun O., & Hamidu A. (2020). A new approach of principal component regression with applications to collinear data. International Journal of Engineering Research and Technology, 13 (7), 1616 1622. DOI: https://doi.org/10.37624/IJERT/13.7.2020.1616-1622
Massy, W.F. (1965). Principal Components Regression in exploratory statistical research. Journal of the American. Statistical. Association. 60, 234-256. DOI: https://doi.org/10.1080/01621459.1965.10480787
Montgomery, D. C. Peck E. A., & Vining, G.G. (2012). Introduction to linear regression analysis. (3rd edition). Wiley and Sons Inc. New York.
Zhang, J., & Ibrahim M. (2020). A simulation study on SPSS ridge regression and ordinary least squares regression procedures for multicollinearity data. Journal of Applied Statistics, 32 (6), 571 588. DOI: https://doi.org/10.1080/02664760500078946
Copyright (c) 2025 FUDMA JOURNAL OF SCIENCES
![Creative Commons License](http://i.creativecommons.org/l/by/4.0/88x31.png)
This work is licensed under a Creative Commons Attribution 4.0 International License.
FUDMA Journal of Sciences