Performance Evaluation of Machine Learning Algorithm Using Chronic Kidney Disease (CKD) Dataset
Keywords:
CKD- Chronic Kidney Disease, EHR- Electronic Health Record, ML- Machine Learning, PCA- Principal Component Analysis, RF- Random Forest, SVM-Support Vector machineAbstract
Machine learning (ML) algorithms enable computers to recognize patterns and make predictions or decisions from data, rather than relying on explicit programming. This paper presents a predictive model for the early detection of CKD through ML. The study uses five years of Electronic Health Record (EHR) data from a diverse patient group. The dataset contains demographics, clinical history, lab results, medication information, and diagnostic codes. The research starts with 25 variables, in addition to the class property, and then reduces this to 15 by using Principal Component Analysis (PCA). This aims to reduce the number of parameters to find the best subset for identifying CKD. The research uses common ML algorithms—Support Vector Machines (SVM), Random Forest, and Logistic Regression—and assesses their ability to detect CKD early. When comparing the classification algorithms, Random Forest (RF) had the best accuracy, at 81.2967%.
Published
How to Cite
Issue
Section
Copyright (c) 2025 AbdulMalik Aliyu, Kabiru Abdulmumin, Bashar Abdullahi Abubakar, Abubakar Suleiman Abba, Abubakar Ibrahim

This work is licensed under a Creative Commons Attribution 4.0 International License.