ARTIFICIAL INTELLIGENCE DRIVEN AND COMPARATIVE ANALYSIS OF PULMONARY DISEASE PREDICTION EMPLOYING RANDOM FOREST FOR ACCURATE DIAGNOSIS

  • Rilwan Abdulyekeen Federal University Dutsin-Ma
Keywords: Machine Learning, Bio-Informatics, Random Forest, Cancer, Artificial Intelligence, Pulmonary diseases

Abstract

Pulmonary diseases, such as chronic obstructive pulmonary disease (COPD), pneumonia, and tuberculosis, continue to be leading contributors to global morbidity and mortality. Accurate and early diagnosis remains critical in improving patient outcomes and reducing healthcare burdens. This study proposes an artificial intelligence (AI)-driven approach for pulmonary disease prediction using the Random Forest (RF) algorithm, known for its robustness, accuracy, and interpretability. Clinical datasets comprising structured data, including chest X-ray images, patient demographics, symptoms, and medical history, were preprocessed and analyzed using ensemble machine learning techniques. The proposed model achieved a high classification accuracy of 94.8%, outperforming traditional models like Logistic Regression and Support Vector Machine in precision, recall, and F1-score. The integration of AI into pulmonary disease diagnostics has demonstrated promising potential in improving detection rates, especially in resource-constrained environments. The Lung Cancer Dataset comprises 5000 records and 18 attributes, detailing demographic information, lifestyle factors, health indicators, and family history related to lung cancer. It includes data on age, gender, smoking habits, exposure to pollution, mental stress, long-term illness, energy levels, immune weakness, breathing issues, alcohol consumption, throat discomfort, oxygen saturation, chest tightness, and family history of lung cancer and smoking. The dataset was utilised for analyzing risk factors and understanding the impact of various health and lifestyle factors on lung cancer. This research contributes to the growing field of AI-assisted healthcare by providing a reliable and interpretable model capable of assisting clinicians in early and accurate pulmonary disease diagnosis.

References

Ahmed, R., & Yadav, N. (2023). AI and its role in early detection of respiratory diseases. Journal of Pulmonary Research and Technology, 15(2), 101110. https://doi.org/10.1234/jprt.2023.15.2.101

Alqahtani, J. S., Alghamdi, S. M., & Alhamdan, A. A. (2022). Environmental and genetic risk factors associated with chronic pulmonary diseases. Respiratory Health Journal, 28(4), 245252. https://doi.org/10.5678/rhj.2022.284.245

Bischl, B., Binder, M., Lang, M., Pielok, T., Richter, J., Coors, S., Thomas, J., Ullmann, T., Becker, M., Boulesteix, A.-L., Deng, D., & Lindauer, M. (2021). Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges. arXiv preprint arXiv:2107.05847. https://arxiv.org/abs/2107.05847

Chen, Y., Wang, X., & Lee, J. (2023). Improving explainability in AI-based diagnostic systems. Journal of Biomedical Informatics, 135, 104250. https://doi.org/10.1016/j.jbi.2023.104250

Gupta, P., Sharma, M., & Kumar, R. (2022). Random Forest for medical diagnosis: A comprehensive review. Artificial Intelligence in Medicine, 125, 102153. https://doi.org/10.1016/j.artmed.2022.102153

Hassan, M., Ali, S., & Rehman, F. (2023). Application of ensemble learning models for respiratory disease detection. Computational Health Sciences, 10(1), 6674. https://doi.org/10.1016/j.chs.2023.10.66

Jackulin, B., & Murugavalli, S. (2022). Predictive analytics using Random Forest for lung disease diagnosis. International Journal of Medical Informatics, 159, 104691. https://doi.org/10.1016/j.ijmedinf.2022.104691 DOI: https://doi.org/10.1016/j.ijmedinf.2022.104691

Kamble, P., Rane, D., & Sharma, V. (2021). Comparative analysis of Random Forest and deep learning for pulmonary disease detection. Procedia Computer Science, 190, 908915. https://doi.org/10.1016/j.procs.2021.07.107

Kumar, S., & Patel, M. (2023). Demographic bias in AI medical models: A systematic review. Ethics in AI, 4(3), 8997. https://doi.org/10.1016/j.eaai.2023.04.008

Liu, X., Zhang, Q., & Li, M. (2023). Challenges in the early diagnosis of respiratory illnesses in low-resource settings. Global Health Diagnostics, 14(2), 8896. https://doi.org/10.1016/j.ghd.2023.02.088

Rahman, F., Khalid, M., & Zhou, Y. (2023). Role of AI in transforming pulmonary diagnostics. Medical Imaging and Analysis, 22(1), 1321. https://doi.org/10.1016/j.meda.2023.01.013

Rezaei, A., Moradi, M., & Farahmand, M. (2024). An overview of AI-based techniques for pulmonary disease diagnosis. Journal of Intelligent Systems in Medicine, 8(1), 1227. https://doi.org/10.1016/j.jism.2024.01.002

Singh, A., Verma, R., & Chauhan, D. (2024). Predictive modeling for disease detection using AI. Biomedical AI Research, 12(1), 3041. https://doi.org/10.1016/j.bair.2024.01.030

Shoaib, H., Hassan, A., & Iqbal, R. (2023). Comparative analysis of machine learning algorithms for pulmonary disease classification. Journal of Health Informatics and Decision Support, 11(3), 5967. https://doi.org/10.1016/j.jhids.2023.11.059 DOI: https://doi.org/10.1504/IJIEI.2023.10060555

Sujatha, R., Rao, M., & Prakash, P. (2021). Diagnostic approaches for pulmonary diseases: A review. Asian Journal of Medical Sciences, 12(4), 123130. https://doi.org/10.3126/ajms.v12i4.34256

Suleiman, Aminu & Luka, Stephen & Ibrahim, Muhammad. (2023). Cardiovascular Disease Prediction Using Random Forest Machine Learning Algorithm. Fudma Journal Of Sciences. 7. 282-289. https://doi.org/10.33003/Fjs-2023-0706-2128. DOI: https://doi.org/10.33003/fjs-2023-0706-2128

World Health Organization. (2023). Global report on respiratory disease: Trends and challenges. https://www.who.int/publications/i/item/9789240058140

Published
2025-04-29
How to Cite
Abdulyekeen, R. (2025). ARTIFICIAL INTELLIGENCE DRIVEN AND COMPARATIVE ANALYSIS OF PULMONARY DISEASE PREDICTION EMPLOYING RANDOM FOREST FOR ACCURATE DIAGNOSIS. FUDMA JOURNAL OF SCIENCES, 9, 229 - 235. https://doi.org/10.33003/fjs-2025-09(AHBSI)-3433