INTEGRATING RANDOM FOREST AND LOGISTIC REGRESSION TO FORECAST CUSTOMER ATTRITION IN THE TELECOM SECTOR

  • Ayoade A. Owoade Tai Solarin University of Education, Ijebu Ode
  • Olugbenga I. Bakare Tai Solarin University of Education, Ijebu Ode
  • Gbenga O. Ogunsanwo Tai Solarin University of Education, Ijebu Ode
Keywords: Random Forest, Machine Learning, Logistic Regression, Customer Attrition

Abstract

Since customers are the foundation of any successful business, companies must prioritize making sure they are satisfied. However, due to increased corporate competition, the importance of customers’ and marketing tactics more informed conduct in the past few years, client attrition is a significant problem and is acknowledged as one of the top concerns among businesses. Organizations must take a number of measures to address the problems with churn brought on by the services they offer. Customer attrition strategies are essential in the fiercely competitive and rapidly changing telecom industry. Utilizing machine learning methods, assess the possibility that a client will leave a firm. This research uses logistic regression, random forest, and big data to predict customer attrition in the telecom sector. A large-scale logistic regression analysis has been used to assess the probability of churn as a function of a variable set or customer attribute. Similarly, based on how close a feature is to customers in each class, random forest is employed to ascertain if or not a customer churns. This research makes use of information from the Kaggle website to forecast and examine churn. According to the results of the study show that 0.84 percent is the area under the curve., and the forecast precision rates for consumer churn using linear regression and random forest are 0.80 and 0.79 percent, respectively.

References

Prabadevi, B., Shalini, R., & Kavitha, B. R. (2023). Customer churning analysis using machine learning algorithms. International Journal of Intelligent Networks, 4. DOI: https://doi.org/10.1016/j.ijin.2023.05.005

Zhao, H., Yao, X., Liu, Z., & Yang, Q. (2021). Impact of pricing and product information on consumer buying behavior with customer satisfaction in a mediating role. Frontiers in Psychology, 12(1). frontiersin. DOI: https://doi.org/10.3389/fpsyg.2021.720151

Srinivasan, R., Rajeswari, D., & Elangovan, G. (2023, January 1). Customer Churn Prediction Using Machine Learning Approaches. IEEE Xplore. DOI: https://doi.org/10.1109/ICECONF57129.2023.10083813

Karamollaolu, H., Yceda, ., & Doru, . A. (2021, September 1). Customer Churn Prediction Using Machine Learning Methods: A Comparative Analysis. IEEE Xplore. DOI: https://doi.org/10.1109/UBMK52708.2021.9558876

Shields, K. (2021). Chapter 3: Managing a Customer Service Team. Ecampusontario.pressbooks.pub, 3(5).

Wagh, S. K., Andhale, A. A., Wagh, K. S., Pansare, J. R., Ambadekar, S. P., & Gawande, S. H. (2023). Customer Churn Prediction in Telecom Sector using Machine Learning Techniques. Results in Control and Optimization, 14, 100342. DOI: https://doi.org/10.1016/j.rico.2023.100342

Ahmad, A. K., Jafar, A., & Aljoumaa, K. (2019). Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data, 6(1). DOI: https://doi.org/10.1186/s40537-019-0191-6

Sana, J. K., Abedin, M. Z., Rahman, M. S., & Rahman, M. S. (2022). A novel customer churn prediction model for the telecommunication industry using data transformation methods and feature selection. PLOS ONE, 17(12) DOI: https://doi.org/10.1371/journal.pone.0278095

Ribeiro, H., Barbosa, B., Moreira, A. C., & Rodrigues, R. G. (2023). Determinants of churn in telecommunication services: a systematic literature review. Management Review Quarterly. DOI: https://doi.org/10.1007/s11301-023-00335-7

Petropoulos, F. (2022). Forecasting: Theory and practice. International Journal of Forecasting, 38(3). sciencedirect. DOI: https://doi.org/10.1016/j.ijforecast.2021.11.001

Mandal, P. C. (2023). Engaging Customers and Managing Customer Relationships. Journal of Business Ecosystems, 4(1), 114. DOI: https://doi.org/10.4018/JBE.322405

Shobana, J., Gangadhar, Ch., Arora, R. K., Renjith, P. N., Bamini, J., & Chincholkar, Y. devidas. (2023). E-commerce customer churn prevention using machine learning-based business intelligence strategy. Measurement: Sensors, 27 DOI: https://doi.org/10.1016/j.measen.2023.100728

Aghaabbasi, M., & Chalermpong, S. (2023). Machine learning techniques for evaluating the nonlinear link between built-environment characteristics and travel behaviors: A systematic review. Travel Behaviour and Society, 33. DOI: https://doi.org/10.1016/j.tbs.2023.100640

Saghir, M., Bibi, Z., Bashir, S., & Khan, F. H. (2019). Churn Prediction using Neural Network based Individual and Ensemble Models. 2019 16th International Bhurban Conference on Applied Sciences and Technology (IBCAST). DOI: https://doi.org/10.1109/IBCAST.2019.8667113

Sarker, I. H. (2021). Machine Learning: Algorithms, Real-World Applications and Research Directions. SN Computer Science, 2(3), 121. Springer. https://doi.org/10.1007/s42979-021-00592-x DOI: https://doi.org/10.1007/s42979-021-00592-x

Lu, Y. L., Zhelavskaya, I. S., & Wang, C. (2022). Neural Decision Tree: A New Tool for Building Forecast Models for Plasmasphere Dynamics. Earth and Space Science, 9(7). DOI: https://doi.org/10.1029/2021EA002175

Zhao, M., Zeng, Q., Chang, M., Tong, Q., & Su, J. (2021). A Prediction Model of Customer Churn considering Customer Value: An Empirical Research of Telecom Industry in China. Discrete Dynamics in Nature and Society, 2021, 112. hindawi. https://doi.org/10.1155/2021/7160527 DOI: https://doi.org/10.1155/2021/7160527

Nguyen, N. Y., Tran, L. V., & Dao, S. V. T. (2022). Churn prediction in telecommunication industry using kernel Support Vector Machines. PLOS ONE, 17(5). DOI: https://doi.org/10.1371/journal.pone.0267935

Wibawa, A. P., Kurniawan, A. C., Murti, D. M. P., Adiperkasa, R. P., Putra, S. M., Kurniawan, S. A., & Nugraha, Y. R. (2019). Nave Bayes Classifier for Journal Quartile Classification. International Journal of Recent Contributions from Engineering, Science & IT (IJES), 7(2), 91. DOI: https://doi.org/10.3991/ijes.v7i2.10659

Dar, S. A. (2022). The Relevance of Taylors Scientific Management in the Modern Era. Journal of Psychology and Political Science (JPPS) ISSN 2799-1024, 2(06), 16.

Michele, P., Fallucchi, F., & De Luca, E. W. (2019). Create Dashboards and Data Story with the Data & Analytics Frameworks. Metadata and Semantic Research, 272283. DOI: https://doi.org/10.1007/978-3-030-36599-8_24

Dash, S., Shakyawar, S. K., Sharma, M., & Kaushik, S. (2019). Big data in healthcare: Management, analysis and future prospects. Journal of Big Data, 6(1), 125.

Ku-Mahamud, K. R. & Basim Alwan, H. (2020). Big data: definition, characteristics, life cycle, applications, and challenges. IOP Conference Series: Materials Science and Engineering, 769, 012007. https://doi.org/10.1088/1757-899x/769/1/012007 DOI: https://doi.org/10.1088/1757-899X/769/1/012007

Favaretto, M., De Clercq, E., Schneble, C. O., & Elger, B. S. (2020). What is your definition of Big Data? Researchers understanding of the phenomenon of the decade. PLOS ONE, 15(2), DOI: https://doi.org/10.1371/journal.pone.0228987

Islam, Md. A. (2024). Impact of Big Data Analytics on Digital Marketing: Academic Review. Journal of Electrical Systems, 20(5s), 786820. https://doi.org/10.52783/jes.2327 DOI: https://doi.org/10.52783/jes.2327

Dash, S., Shakyawar, S. K., Sharma, M., & Kaushik, S. (2019). Big data in healthcare: Management, analysis and future prospects. Journal of Big Data, 6(1), 125. springer. DOI: https://doi.org/10.1186/s40537-019-0217-0

Batko, K., & lzak, A. (2022). The Use of Big Data Analytics in Healthcare. Journal of Big Data, 9(1). https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8733917/ DOI: https://doi.org/10.1186/s40537-021-00553-4

Roberts, S. (2023). What are the 3Vs of Big Data? Www.theknowledgeacademy.com. https://www.theknowledgeacademy.com/blog/big-data-3v/

Jabeen, H. (2020). Chapter 3 Big Data Outlook, Tools, and Architectures. Lecture Notes in Computer Science, 3555. https://doi.org/10.1007/978-3-030-53199-7_3 DOI: https://doi.org/10.1007/978-3-030-53199-7_3

Berisha, B., Meziu, E., & Shabani, I. (2022). Big data analytics in Cloud computing: an overview. Journal of Cloud Computing, 11(1). DOI: https://doi.org/10.1186/s13677-022-00301-w

Cavlak, N., & Cop, R. (2021). The Role of Big Data in Digital Marketing. Advances in Marketing, Customer Relationship Management, and E-Services, 1633. https://doi.org/10.4018/978-1-7998-8003-5.ch002 DOI: https://doi.org/10.4018/978-1-7998-8003-5.ch002

Udeh, C. A., Orieno, O. H., Daraojimba, O. D., Ndubuisi, N. L., & Oriekhoe, O. I. (2024). Big Data Analytics: A Review of Its Transformative Role in Modern Business Intelligence. Computer Science & IT Research Journal, 5(1), 219236. DOI: https://doi.org/10.51594/csitrj.v5i1.718

Probst, P., Wright, M. N., & Boulesteix, A. (2019). Hyperparameters and tuning strategies for random forest. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 9(3). DOI: https://doi.org/10.1002/widm.1301

Qin, Z., Liu, Y., & Zhang, T. (2022). Research on Early Warning of Customer Churn Based on Random Forest. Journal on Artificial Intelligence, 4(3), 143154. DOI: https://doi.org/10.32604/jai.2022.031843

Sultan, A., Saabun, W., Faizi, S., & Ismail, M. (2021). Hesitant Fuzzy Linear Regression Model for Decision Making. Symmetry, 13(10), 1846. DOI: https://doi.org/10.3390/sym13101846

Jain, H., Khunteta, A., & Srivastava, S. (2020). Churn Prediction in Telecommunication using Logistic Regression and Logit Boost. Procedia Computer Science, 167, 101112. DOI: https://doi.org/10.1016/j.procs.2020.03.187

Mooney, M. A., Neighbor, C., Karalunas, S., Dieckmann, N. F., Nikolas, M., Nousen, E., Tipsord, J., Song, X., & Nigg, J. T. (2022). Prediction of Attention-Deficit/Hyperactivity Disorder Diagnosis Using Brief, Low-Cost Clinical Measures: A Competitive Model Evaluation. Clinical Psychological Science, 216770262211202. https://doi.org/10.1177/21677026221120236 DOI: https://doi.org/10.1101/2021.12.23.21268330

Martnez, A., Schmuck, C., Pereverzyev, S., Pirker, C., & Haltmeier, M. (2020). A machine learning framework for customer purchase prediction in the non-contractual setting. European Journal of Operational Research, 281(3), 588596. DOI: https://doi.org/10.1016/j.ejor.2018.04.034

Sleiman, R., Mazyad, A., Hamad, M., Tran, K.-P., & Thomassey, S. (2022). Forecasting Sales Profiles of Products in an Exceptional Context: COVID-19 Pandemic. International Journal of Computational Intelligence Systems, 15(1), 99. DOI: https://doi.org/10.1007/s44196-022-00161-x

Idriss, I. A., Cheng, W., & Hailu, Y. (2023). Weighted Maximum Likelihood Technique for Logistic Regression. Open Journal of Statistics, 13(06), 803821. DOI: https://doi.org/10.4236/ojs.2023.136041

Published
2025-06-30
How to Cite
Owoade, A. A., Bakare, O. I., & Ogunsanwo, G. O. (2025). INTEGRATING RANDOM FOREST AND LOGISTIC REGRESSION TO FORECAST CUSTOMER ATTRITION IN THE TELECOM SECTOR. FUDMA JOURNAL OF SCIENCES, 9(6), 80 - 89. https://doi.org/10.33003/fjs-2025-0906-3556