CREDIT RISK ANALYSIS: AN ASSESSMENT OF THE PERFORMANCE OF SIX MACHINE LEARNING TECHNIQUES IN CREDIT SCORING MODELLING
Abstract
This study checked the credit risk analysis domain, concentrating on assessing the efficacy of six distinct credit scoring methodologies: linear discriminant analysis, logistic regression, artificial neural networks, support vector machine, decision tree and, K-nearest neighbour on microcredit applicant’s data. Two performance metrics were used: Area under the receiver operative characteristic curve and, Precision. The results obtained from the experimentation phase reveal distinct performance levels for each technique. Specifically, K-nearest neighbour and artificial neural networks showcase exceptional prowess, yielding an AUC of 0.9833 and 0.9062 and, an impressive precision score of 0.8065 and 1 respectively. In contrast, logistic regression and support vector machine demonstrate a good performance with an area under the curve value of 0.8537 and 0.8532 respectively, on precision metric score, support vector machine showed impressive high performance while logistic regression performed poorly. Linear discriminant analysis and Decision tree exhibit comparatively moderate accuracy scores and achieved an AUC of 0.8494318 and 0.7524 respectively. Thus, we underscore the potential of K-nearest neighbour and Artificial neural networks as a superior method for credit risk analysis, supported by robust performance metrics. Although, all techniques achieve significantly good discriminative power and good precision. The findings advocate for the adoption of modern techniques in credit scoring modelling, positioning K-nearest neighbour and Artificial neural networks as a valuable tool in financial institutions’ risk assessment processes.
References
Baesens, B., Setiono, R., Mues, C., & Vanthienen, J. (2003). Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation. Journal of Operational research and the management science, 312-329. DOI: https://doi.org/10.1287/mnsc.49.3.312.12739
Bank for International Settlement (2010). Basel Committee on Banking Supervision Publication (BCBS Publication) https://www.bis.org/bcbs/publications.htm
Bekesiene, S., Smaliukiene, R., & Vaicaitiene, R. (2021). Using artificial neural networks in predicting the level of stress among military conscripts. Mathematics, 9(6). https://doi.org/10.3390/math9060626 DOI: https://doi.org/10.3390/math9060626
Coffman, J. (1986). The proper role of tree analysis in forecasting the risk behavior of borrowers. Journal of Management Decision Systems, Atlanta, MDS Reports, 3 (4), 7.
Desai, V. S., Crook, J. N., & Overstreet Jr, G. A. (1996). A comparison of neural net- works and linear scoring models in the credit union environment. European Journal of Operational Research, 95 (1), 2437. DOI: https://doi.org/10.1016/0377-2217(95)00246-4
Fisher, R.A. (1936). The Use of Multiple Measurements in Taxonomic Problems. Annals of Eugenics, 7,179-188. http://dx.doi.org/10.1111/j.1469-1809.1936.tb02137.x DOI: https://doi.org/10.1111/j.1469-1809.1936.tb02137.x
Garson, G.D. (1991). Interpreting neural network connection weights. Articial Intelligence Expert, Vol.6, No.4, pp.46-51.
Han, J. and Kamber, M. (2006). Data Mining: Concepts and Techniques (2nd ed.). San Franscisco: Elsevier, 770 pages.
Henley, W., & Hand, D. (1996). A k-nearest neighbor classifier for assessing consumer credit risk. The Statistician, 45 (1), 7795. DOI: https://doi.org/10.2307/2348414
Henley, W. E., & Hand, D. J. (1997). Construction of a k-nearest neighbor credit-scoring system. IMA Journal of Mathematics Applied in Business and Industry, 8, 305321. DOI: https://doi.org/10.1093/imaman/8.4.305
Hosmer, D., & Lemeshow, S. (1989). Applied Logistic Regression. New York: John Wiley & Sons, Inc.
Jha, G.K. (2007). Artificial neural networks and its application. IARI, NewDehli, girish_iasri@rediffmail.com, pp. 1-10.
Lahsasna, A., Ainon, R.N., Wah, Y. T.,(2010): Credit Scoring Models Using Soft Computing Methods: A Survey. International Arab Journal of Information Technology 7(2):115-123 https://www.researchgate.net/journal/International-Arab-Journal-of-Information-Technology-1683-3198
Lewis, R. J. (2000, May). An introduction to classification and regression tree (CART) analysis. In Annual meeting of the society for academic emergency medicine in San Francisco, California (Vol. 14). San Francisco, CA, USA: Department of Emergency Medicine Harbor-UCLA Medical Center Torrance.
Makowski, P. (1985). Credit scoring branches out. Credit World, 75 (1), 3037.
May, R., Dandy, G. and Maier, H. (2011). Review of Input Variable Selection Methods for Artificial Neural Networks. Artificial Neural Networks- Methodological Advances and Biomedical Applications, Prof. Kenji Suzuki (Ed.), InTech, DOI: 10.5772/16004. [online] Available from: http://www.intechopen.com/books/artificial-neural-networks-methodological-advances-and-biomedical/-applications-review-of-input-variable-selection-methods/-for-artificial-neural-networks, [4 February 2017]. DOI: https://doi.org/10.5772/16004
Park, H.A. (2013) An Introduction to Logistic Regression: From Basic Concepts to Interpretation with Particular Attention to Nursing Domain. Journal of Korean Academy of Nursing. 43(2):154-164 . https://doi.org/10.4040/jkan.2013.43.2.154. DOI: https://doi.org/10.4040/jkan.2013.43.2.154
Peterson, L. E. (2009). K-nearest neighbor. Scholarpedia, 4(2), 1883. DOI: https://doi.org/10.4249/scholarpedia.1883
Srinivasan, V., & Kim, Y. H. (1987). Credit granting: A comparative analysis of classification procedures. The Journal of Finance, 42 (3), 665681. DOI: https://doi.org/10.2307/2328378
Sandri, M., & Zuccolotto, P. (2010). Analysis and correction of bias in total decrease in node impurity measures for treebased algorithms. Statistics and Computing, 20, 393-407 DOI: https://doi.org/10.1007/s11222-009-9132-0
Song, Y. Y., & Ying, L. U. (2015). Decision tree methods: applications for classification and prediction. Shanghai archives of psychiatry, 27(2), 130.
Tabachnick, B.G., and Fidell , L. S. (1996), "Using multivariate statistics", (3rd ed.), New York: Harper Collins publishers.
Tam, K. Y., & Kiang, M. Y. (1992). Managerial applications of neural networks: The case of bank failure predictions. Journal of Management Science, 38 (7), 926947. DOI: https://doi.org/10.1287/mnsc.38.7.926
United Nations Sustainable development goals, goal 10. https://sdgs.un.org/goals/goal10
Veall, M. and Zimmermann, K.(1996). Evaluating Pseudo-R2's for Binary Probit Models. Quality & Quantity: International Journal of Methodology, Springer, vol. 28(2), pages 151-164, May. DOI: https://doi.org/10.1007/BF01102759 DOI: https://doi.org/10.1007/BF01102759
West, D. (2000). Neural network credit scoring models. Computers & Operations Re- search, 27 (1112), 11311152. DOI: https://doi.org/10.1016/S0305-0548(99)00149-5
Wu, J., Tennyson, R.D. and, Hsia, T. (2010). A study of student satisfaction in a blended e-learning system environment. Journal of Computers & Education 55(1):155-164 http://dx.doi.org/10.1016/j.compedu.2009.12.012 DOI: https://doi.org/10.1016/j.compedu.2009.12.012
Yap, B., Ong, S., and Husain, N. (2011). Using data mining to improve assessment of credit worthiness via credit scoring models. Expert Systems with Application DOI: https://doi.org/10.1016/j.eswa.2011.04.147
Yobas, M. B., Crook, J. N., & Ross, P. (2000). Credit scoring using neural and evolutionary techniques. IMA Journal of Mathematics Applied in Business and Industry, 11, 111125. DOI: https://doi.org/10.1093/imaman/11.2.111
Copyright (c) 2024 FUDMA JOURNAL OF SCIENCES
This work is licensed under a Creative Commons Attribution 4.0 International License.
FUDMA Journal of Sciences