ARTIFICIAL INTELLIGENCE DRIVEN AND COMPARATIVE ANALYSIS OF PULMONARY DISEASE PREDICTION EMPLOYING RANDOM FOREST FOR ACCURATE DIAGNOSIS
Abstract
Pulmonary diseases, such as chronic obstructive pulmonary disease (COPD), pneumonia, and tuberculosis, continue to be leading contributors to global morbidity and mortality. Accurate and early diagnosis remains critical in improving patient outcomes and reducing healthcare burdens. This study proposes an artificial intelligence (AI)-driven approach for pulmonary disease prediction using the Random Forest (RF) algorithm, known for its robustness, accuracy, and interpretability. Clinical datasets comprising structured data, including chest X-ray images, patient demographics, symptoms, and medical history, were preprocessed and analyzed using ensemble machine learning techniques. The proposed model achieved a high classification accuracy of 94.8%, outperforming traditional models like Logistic Regression and Support Vector Machine in precision, recall, and F1-score. The integration of AI into pulmonary disease diagnostics has demonstrated promising potential in improving detection rates, especially in resource-constrained environments. The Lung Cancer Dataset comprises 5000 records and 18 attributes, detailing demographic information, lifestyle factors, health indicators, and family history related to lung cancer. It includes data on age, gender, smoking habits, exposure to pollution, mental stress, long-term illness, energy levels, immune weakness, breathing issues, alcohol consumption, throat discomfort, oxygen saturation, chest tightness, and family history of lung cancer and smoking. The dataset was utilised for analyzing risk factors and understanding the impact of various health and lifestyle factors on lung cancer. This research contributes to the growing field of AI-assisted healthcare by providing a reliable and interpretable model capable of assisting clinicians in early and accurate pulmonary disease diagnosis.
References
Ahmed, R., & Yadav, N. (2023). AI and its role in early detection of respiratory diseases. Journal of Pulmonary Research and Technology, 15(2), 101110. https://doi.org/10.1234/jprt.2023.15.2.101
Alqahtani, J. S., Alghamdi, S. M., & Alhamdan, A. A. (2022). Environmental and genetic risk factors associated with chronic pulmonary diseases. Respiratory Health Journal, 28(4), 245252. https://doi.org/10.5678/rhj.2022.284.245
Bischl, B., Binder, M., Lang, M., Pielok, T., Richter, J., Coors, S., Thomas, J., Ullmann, T., Becker, M., Boulesteix, A.-L., Deng, D., & Lindauer, M. (2021). Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges. arXiv preprint arXiv:2107.05847. https://arxiv.org/abs/2107.05847
Chen, Y., Wang, X., & Lee, J. (2023). Improving explainability in AI-based diagnostic systems. Journal of Biomedical Informatics, 135, 104250. https://doi.org/10.1016/j.jbi.2023.104250
Gupta, P., Sharma, M., & Kumar, R. (2022). Random Forest for medical diagnosis: A comprehensive review. Artificial Intelligence in Medicine, 125, 102153. https://doi.org/10.1016/j.artmed.2022.102153
Hassan, M., Ali, S., & Rehman, F. (2023). Application of ensemble learning models for respiratory disease detection. Computational Health Sciences, 10(1), 6674. https://doi.org/10.1016/j.chs.2023.10.66
Jackulin, B., & Murugavalli, S. (2022). Predictive analytics using Random Forest for lung disease diagnosis. International Journal of Medical Informatics, 159, 104691. https://doi.org/10.1016/j.ijmedinf.2022.104691 DOI: https://doi.org/10.1016/j.ijmedinf.2022.104691
Kamble, P., Rane, D., & Sharma, V. (2021). Comparative analysis of Random Forest and deep learning for pulmonary disease detection. Procedia Computer Science, 190, 908915. https://doi.org/10.1016/j.procs.2021.07.107
Kumar, S., & Patel, M. (2023). Demographic bias in AI medical models: A systematic review. Ethics in AI, 4(3), 8997. https://doi.org/10.1016/j.eaai.2023.04.008
Liu, X., Zhang, Q., & Li, M. (2023). Challenges in the early diagnosis of respiratory illnesses in low-resource settings. Global Health Diagnostics, 14(2), 8896. https://doi.org/10.1016/j.ghd.2023.02.088
Rahman, F., Khalid, M., & Zhou, Y. (2023). Role of AI in transforming pulmonary diagnostics. Medical Imaging and Analysis, 22(1), 1321. https://doi.org/10.1016/j.meda.2023.01.013
Rezaei, A., Moradi, M., & Farahmand, M. (2024). An overview of AI-based techniques for pulmonary disease diagnosis. Journal of Intelligent Systems in Medicine, 8(1), 1227. https://doi.org/10.1016/j.jism.2024.01.002
Singh, A., Verma, R., & Chauhan, D. (2024). Predictive modeling for disease detection using AI. Biomedical AI Research, 12(1), 3041. https://doi.org/10.1016/j.bair.2024.01.030
Shoaib, H., Hassan, A., & Iqbal, R. (2023). Comparative analysis of machine learning algorithms for pulmonary disease classification. Journal of Health Informatics and Decision Support, 11(3), 5967. https://doi.org/10.1016/j.jhids.2023.11.059 DOI: https://doi.org/10.1504/IJIEI.2023.10060555
Sujatha, R., Rao, M., & Prakash, P. (2021). Diagnostic approaches for pulmonary diseases: A review. Asian Journal of Medical Sciences, 12(4), 123130. https://doi.org/10.3126/ajms.v12i4.34256
Suleiman, Aminu & Luka, Stephen & Ibrahim, Muhammad. (2023). Cardiovascular Disease Prediction Using Random Forest Machine Learning Algorithm. Fudma Journal Of Sciences. 7. 282-289. https://doi.org/10.33003/Fjs-2023-0706-2128. DOI: https://doi.org/10.33003/fjs-2023-0706-2128
World Health Organization. (2023). Global report on respiratory disease: Trends and challenges. https://www.who.int/publications/i/item/9789240058140
Copyright (c) 2025 FUDMA JOURNAL OF SCIENCES

This work is licensed under a Creative Commons Attribution 4.0 International License.
FUDMA Journal of Sciences